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A PROTEIN REGION RESPONSIBLE OF BINDING TO EPITHELIAL CELL TYPES 
AND A DNA SEQUENCE ENCODING SAID REGION 

The present invention relates to DNA molecules encoding polypeptides responsible of 
5 binding to human and/or animal epithelial cell types, like intestinal, urogenital and/or 
. endothelial cells. Further this invention relates to vectors containing the DNA molecules and 
hosts transformed with the DNA molecules of this invention. 

The present invention relates also to a method of constructing new hosts or new proteins 
10 capable of binding to human and/or animal epithelial cell types. 

This invention also relates to genes encoding preselected proteins modified to bind to 
human and/or animal epithelial cell types. 

15 In addition this invention relates to a host cell, to a protein and to a method for carrying 
preselected factors/properties to human and/or animal epithelial cells or cell surfaces by 
using a Lactobacillus brevis strain or L. brevis Sip A protein. ' 

Bacterial adhesion to human epithelial and subepithelial tissues is a decisive initial event in 
2 0 successful colonization of tissue sites by invading bacteria. Several molecular ligand- 
receptor interactions have been characterized for bacterial species that cause infectious 
diseases in man or animals. Adhesion of pathogenic bacteria to the tissue at the infection 
site helps the bacteria to resist mechanical defences of our body, such as peristalsis in the 
intestine or flow of urine in the urinary tract Adhesion is a key determinant in the host, 
2 5 tissue and cell-type tropism of bacterial infections. Attachment to tissues is also important 
for those bacteria that establish themselves as members of the normal bacterial flora in the 
human body. Species of Lactobacillus are major members of the indigenous bacterial flora 
in the gastrointestinal and the genital tract of man and animals. Lactobacilli are thought to 
be beneficial to their host organism and have a long history of use in the gastrointestinal and 
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the urogenital tract to prevent or cure various minor illnesses. Probiotic effects of 
lactobacilli include exclusion of invasive pathogens from intestinal and vaginal surfaces, 
production of antimicrobial substances, stimulation of immune systems, as well as other 
physiological effects. As lactobacilli are members, of the normal bacterial flora and food- 
grade organisms, their possible use as carriers of vaccine antigens in the intestine has 
aroused interest. The mechanisms by which lactobacilli bring about the probiotic effects 
have remained uncharacterized, but it is generally agreed that efficient adhesion to epithelial 
surfaces is important for the colonization of the intestine as well as for the effects associated 
with these bacteria. Isolates of lactobacilli have been shown to adhere to the intestinal 
epithelium of their hosts (Goconnier et al. y Appi. Environ. Microbiol 58:2034 - 2039 
(1992) and Henriksson et aL, Appi Environ. Microbiol 57:499 - 502 (1991)), but to date, 
however, no molecular ligand-receptor interactions of lactobacilli have been characterized. 
Considering the high number of lactobacilli in our body and the biotechnological, health- 
associated as well as ecological importance of these bacteria, molecular characterization of 
their adhesion mechanisms is important. 

S-layers are paracrystalline surface protein arrays that are commonly expressed by species 
of Eubacteria and Archaebacteria (reviewed in Messner and Sleytr, Adv. Microb. Physiol. 
33: 213 -275 (1992) and Sleytr and Sara, Trends Biotechnol. 15:20-26 (1997). Most S- 
2 0 layers are composed of a single protein species, the S-layer protein, greatly varying in size 
in different bacterial genera. The S-layer subunits are very hydrophobic and crystallize to 
form a two-dimensional layer on the bacterial surface. The genes encoding S-layers are 
efficiently transcribed, and the S-layer protein is the dominant protein species representing 
1 0-20% of the total cellular protein of the bacterial cell. Differing functions have been 

2 5 attributed to the S-layers of different bacterial species. These functions include maintenance 

of the cell shape, protection of cells from hostile environment, anchorage of extracellular 
enzymes to the bacterial cell wall and mediation of bacterial attachment to animal tissues 
(Chu et al, J: Biol. Chem.266 .15258-15265 (1991), Schneitz et aL, J. Appi. Microbiol. 74: 
290- 294 (1993) and Toba et al. 9 Appi. Environ. Microbiol. 61:2467-2471 (1995)). The S- 

3 0 layer of the fish pathogen A eromonas salmonicida binds to extracellular matrix proteins and 

increases bacterial virulence by promoting bacterial spread to cause systemic infection in the 



10 
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fish (Chu et al, J. Biol C7?e/?2.266:15258-15265 (1991)). Most S-layer proteins aggregate 
in physiological buffers and their functional analysis have been restricted to solid phase 
assays (Sleytr and Sara, Trends Biotechnol 15:20-26 (1997)), which has remained a severe 
limitation in the functional analysis of these important surface proteins. 

5 

S-layers are expressed by various species of the genus Lactobacillus (Masuda and Kawata 
FEMS Microbiol Lett 20:145-150 (1983). Their role in bacterial adhesiveness to chicken 
epithelium as well as to human and mouse extracellular matrix have been suggested 
(Schneitz et al, J. Appl Microbiol 14: 290- 294 (1993), Toba et al, Appl Environ. 

10 Microbiol 61:2467-2471 (1995)), but overall, the functions of lactobacillar S-layers have 
remained poorly characterized. Primary structure of a few lactobacillar S-layers have been 
determined (Vidgren et al 1 Bacteriol 174: 7419 - 7427 (1992), Boot et al, J. Bacteriol 
175: 6089- 6096 (1993) and Boot et al, J. Bacteriol 111: 7222- 7230. The predicted 
lactobacillar S-layer proteins are 40 to 50 kDa in molecular size and show similarity in 

15 amino acid compositions. 

Lactobacilli are important bacterial colonizers of our intestinal surfaces. -Despite their high 
number and potential symbiotic effects in our body, our knowledge of the colonization 
mechanisms that the lactobacilli use to attach and multiply our intestine have remained 
2 0 uncharacterized. This has been in part due to the restricted methodology to genetically 

manipulate these bacteria and also due to the lack of suitable methods to study the binding 
mechanisms of these bacteria to intestinal or other mucosal surfaces. 

WO 97/14802 suggests the use of Lactobacillus fermentum 1 04R 29 kD adherence factor . 
2 5 for promoting the activity of microorganism cells to bind to a receptor recognized on 
mucus. However, the finding of a factor capable of binding to mucus does not solve the 
problem of specifically carrying preselected factors to human or animal epithelial cells. 
Mucus on the surface of intestinal tract and any factors bound to the mucus are easily rinsed 
out from the intestine. 

30 

In this invention a DNA molecule encoding a protein region responsible of binding of the 
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protein to intestinal, urogenital, endothelial and/or other epithelial cell types has for the first 
time been identified and characterized. Although there has been some preliminary notions 
about the possible binding capability of the S-layer proteins of lactic acid bacteria (LAB) to 
human or animal cells, the binding property has not been confirmed to be due to the S-layer 
5 protein. The DNA molecule encoding a protein region responsible of binding was 

unexpectedly found from a gene encoding the S-layer protein of a species of lactic acid 
bacteria, Lactobacillus brevis. However, according to this invention a homologous DNA 
molecule encoding similar advantageous binding properties, may be synthetic or 
semisynthetic or originate from the same or another group of microorganisms. 

10 . 

This invention results in various advantages. This invention makes it for the first time 
possible to modify or improve the binding capacity of varipus prokaryotic or eukaryotic 
cells to human or animal epithelial cell types, like intestinal, urogenital and/or endothelial 
cell types by using lactobacillar surface structures. In particular, it is possible with the 
15 nucleotide sequences of this invention to improve the binding properties of a host cell 
having probiotic effects to human and/or animal epithelial cell types. 

It is possible with the nucleotide sequences of this invention to modify or improve the 
binding properties of a preselected protein to human and/or animal epithelial cell types. 

2 0 

Furthermore it is possible by gene technological means to modify a host cell having or 
being modified to have the binding capability, to carry antigens, advantageously vaccine 
antigens to the intestinal and/or urogenital tract of humans and/or animals. In particular, it is 
possible to modify strains of Lactobacillus brevis species. 

25 

The probiotic properties of various strains of Lactobacillus brevis species can be improved 
by genetic modification. Totally new properties can be transferred to the strains of 
Lactobacillus brevis species or other hosts of this invention, which may carry these 
properties to human or animal gastrointestinal or urogenital tract. 

30 

Various host cells having or being modified to have the binding capability can be used to 
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colonize the human or animal gastrointestinal or urogenital tract and to exclude pathogens 
by binding to cell receptors, which would otherwise be bound by pathogens or by growing 
to. cell surface areas which would otherwise be colonized by pathogens. 

5 Next, the invention will be described in more detail with the aid of the attached figures, 
of which 

Figure 1 . Effect of S-layer removal on the adherence of I. brevis ATCC 8287 to human 
Intestine 407 cells. Panel a shows adherence of untreated bacteria to the intestinal cells, and 
10 panel b shows the adhesiveness of bacteria treated with guanidine hydrochloride to remove 
S-layer from the bacterial surface. 

Figure 2. Quantitative analysis of the effect of S-layer removal on the adherence of L.brevis 
to Intestine 407 cells. Panel A shows the number of adherent bacteria per epithelial cell; the 

15 adhesion test with the treated and the untreated bacteria was performed at four different 
bacterial cell densities indicated below. Panel B shows the polyacrylamide gel electro- 
phoresis in sodium dodecyl sulfate (SDS-PAGE) of the proteins released from the bacterial 
surface by the Laemmli sample buffer used in SDS-PAGE, The S-layer peptide is indicated 
by the arrow. The S-layer peptide is the dominant peptide species released from the cells but 

2 0 not the only one. 

Figure 3. Schematic representation of the SlpA fragments expressed as fusion to flagellin. 
On top, the hydropathicity plot of the entire 465-amino acid SlpA peptide, The most 
probable antigenic determinant is indicated by the dashed line, and the SlpA peptide has a 

2 5 signal sequence of 30 amino acids. Below, the bars indicate the fragments expressed in fliC, 

the numbers refer to the N- and C-terminal amino acids in the SlpA peptide. Binding of the 
chimeric flagella to Intestine 407 (Int 407) cells is indicated on the right. 

Figure 4. Binding of the SlpA96-200/ FliC chimeric flagella to the human Intestine 407 

3 0 (Panel a) and urinary bladder T24 cells (Panel c). The binding was visualized by indirect 

immunofluorescence. Panels b and d show the corresponding fields by light microscopy. 
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Panel e shows the binding of the a FliC flagella to Intestine 407 cells, and Panel g the 
control staining without flagella; the corresponding light microscopic fields are shown in 
Panels f and h. Arrows indicate binding of the chimeric flagella, arrowhead indicates lack of 
binding. 

Figure 5. The nucleotide sequence of Lactobacillus brevis SlpA gene and the corresponding 
amino acid sequence. The first and last nucleotide residue of the coding sequence and of the 
various fragments is marked with an arrow. Correspondingly the first and last amino acid 
residue of the entire peptide and of the fragments is marked with a circle.. 

DNA molecules of the invention 

In this invention a DNA molecule encoding a protein region responsible of binding to human 
and/or animal intestinal, urogenital, endothelial and/or other epithelial cell types has for the 
first time been identified and characterized. 

In this invention it has been found that the S-layer protein SlpA of Lactobacillus brevis 
has adhesive properties to epithelial cell types. The primary structure of this protein and the 
corresponding gene has been described in WO 94/00581 and in Vidgren el al '., J. 
Bactehol. 174: 7419-7427 (1992). According to Palva, A in EL Bahl et al./, FEMS 
Microbiology Reviews 20 (1997):47 - 98 preliminary results have indicated that L. brevis S- 
layer protein could mediate binding to intestinal epithelial cells. However, until this date 
there has not been any suitable method to study the role of S-layer proteins of lactobacilli. 
For instance Schneitz et al. J. Appl Microbiol 74: 290- 294 (1993) have reported that the 
S-layer of L. acidophilus was involved in the adhesion of these bacteria to avian intestinal 
epithelial cells, but chemical removal of the S-layers did not affect the adhesion to Caco-2 
cells. Guanidine hydrochloride extraction, which was used in the experiments, removes 
many protein types in addition to S-layer and hence the role of lactobacillar S-layers can not 
be deduced from these experiments. 



By using a recently introduced flagella display system (Westerlund-Wikstrom et al. Protein 
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- Engm. 10: 1319 - 1.326 (1997)) we studied the adhesive properties of the S-layer protein 
SlpA of Lactobacillus brevis. We found, surprisingly, that the full length but also very short 
regions of the Lactobacillus brevis S-layer protein were capable of binding to epithelial cell 
types. The predicted size of the full-length S-layer protein is 465 amino acids. In the present 

5 invention various fragments of slpA were expressed as gene fusions in the variable region of 
they7/C H7 gene of Escherichia coli. The resulting chimeric flagella were assessed by indirect 
immunofluorescence for binding to various epithelial cells. 

The S-layer peptides needed for the binding were contained in fragments of 270, 215, 275, 
10 150 and 105 amino acid residues respectively, the shortest fragment representing residues 
96 through 200 in the S-layer protein. However, any fragment being a partial amino acid 
sequence of these sequences or of the entire Lactobacillus brevis S-layer protein and 
possessing similar binding capacity as the above mentioned fragments is a polypeptide of 
this invention and any DNA molecule encoding these polypeptides is a DNA molecule of 
15 this invention. 

Chimeric flagella harbouring inserts that represented the N-terminal part of the S-layer 
protein bound to intestinal as well as urinary bladder cells, whereas the C-terminal part of 
the S-layer protein did not confer binding on the chimeric flagella. However, the C-terminal 

2 0 parts of the protein may have an effect in enhancing the efficiency of binding. 

The S-layer expressing bacterium Lactobacillus brevis ATCC 8287 efficiently adhered to 
the human small intestinal cell line and to the human urinary bladder cell line. 
Bacterial adhesiveness to both cell lines was completely abolished after removal of the S- 
25 layer protein (SlpA) from the bacterial surface by guanidine hydrochloride extraction. 

This invention is directed to a DNA molecule encoding a polypeptide capable of binding to 
human and/or animal epithelial cell types, like intestinal, urogenital and/or endothelial cells. 
The cells may originate from human or animal origin, like from porcine or poultry origin or 

3 0 from pet animals. The cells may be normal or e.g. tumour cells. The DNA molecule may be 

a DNA molecule having the frill length or the partial sequence i.e. the coding sequence 
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contained in the nucleotide sequence of any one of SEQ ID NO. 1, SEQ ID NO. 2, SEQ ID 
NO. 3, SEQ ID NO. 4 , SEQ ID NO 5 or SEQ ID NO. 6, representing the various 
fragments of the Lactobacillus brevis slpA gene, excluding, however, the full length SEQ 
ID NO. 6. 

By a partial nucleotide sequence is meant a nucleotide sequence lacking at least one 
nucleotide residue as compared to the nucleotide sequences of SEQ ID 1 to SEQ ID 6. 

SEQ ID NO. 1 represents the 3 15 nucleotide residues from 286 to 600, SEQ ID NO. 2 
represents the 450 nucleotide residues from 286 to 735, SEQ ID NO. 3 represents the 825 
nucleotide residues from 286 to 1 1 10, SEQ ID NO. 4 represents the 645 nucleotide 
residues from 91 to 735, SEQ ID NO 5 represents the 810 nucleotide residues from 91 to 
900 and SEQ ID NO. 6 represents the entire coding sequence of Lactobacillus brevis sip A 
gene from 1 to 1395 nucleotide residues. The first and last nucleotide residue of the 
nucleotide sequences of SEQ ID NO. 1 to SEQ ID NO. 6 is marked with an arrow in Figure 
5. 

The DNA molecule of this invention may be a DNA molecule encoding a polypeptide 
having the full length or the partial amino acid sequence i.e. the amino acid sequence 
contained in any one of SEQ ID NO. 7, SEQ ID NO. 8 , SEQ ED NO. 9, SEQ ID NO. 10 , 
SEQ ID NO 1 1 or SEQ ID NO 12, representing the various fragments of Lactobacillus 
brevis SlpA protein, excluding, however, the full length SEQ ID NO. 12. SEQ ID NO. 7 
represents the shortest amino acid sequence of 1 05 amino acid residues between 96 and 
200, SEQ ID NO. 8 represents 150 amino acid residues between 96 and 245, SEQ ID NO. 
9 represents 275 amino acid residues between 96 and 370, SEQ ID NO. 10 represents 215 
amino acid residues between 3 1 and 245 and SEQ ID NO 1 1 represents 270 amino acid 
residues between 31 and 300. SEQ ID NO 12 represents the amino acid sequence of the 
entire Lactobacillus brevis SlpA protein between 1 and 465. The first and last amino acid 
residue of the amino acid sequences of SEQ ID NO. 7 to SEQ ID NO. 12 is marked with a 
circle in Figure 5 . 
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By a partial amino acid sequence is meant an amino acid sequence lacking at least one 
amino acid compared to the amino acid sequences SEQ ID NO. 7 to SEQ ID NO. 12. 

The present invention furthermore relates to DNA molecules, the sequences of which 
5 differ from the sequences of the above-identified molecules due to degeneracy of the 
genetic code, and which code for a polypeptide capable of binding to human and/or animal 
epithelial cell types, like intestinal, urogenital and/or endothelial cells. 

The present invention relates also to DNA molecules, the sequences of which hybridize to 
10 any one of the DNA molecules above encoding a polypeptide capable of binding to human 
and/or animal epithelial cell types, like intestinal, urogenital and/or endothelial cells. 

The term "hybridization" in this context means hybridization under conventional 
hybridization conditions, preferably under stringent conditions such as described by, e.g. 
15 Sambrook et al. (1989, Molecular Cloning, A Laboratory Manual 2nd Edition, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY). Typical stringent 
hybridization conditions are exemplified in example 7, but equal hybridizations can be 
carried out in slightly different conditions as is known to a person skilled in the art. 

2 0 In example 7 the slpA gene of L. brevis has been hybridized to the chromosomal DNA of 
other Lactobacillus strains. As can be seen from example 7, the hybridization method is 
very useful and reliable method to find new DNA molecules of this invention. All the L. 
brevis strains tested gave positive hybridisation signal except two strains, which were shown 
to lack the S-layer protein. Other S-protein expressing lactobacilli belonging to other 

2 5 lactobacilli species gave negative result. L. bucfmeri, which is closely related to L. brevis 

gave also positive signal. L. buchneri carries S-layer and has the capability of binding to 
human and/or animal epithelial cell types, like intestinal, urogenital and/or endothelial 
cells . 

3 0 These nucleic acid molecules that hybridize to the nucleic acid molecules of the present 
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invention can in principle be derived from any organism possessing such nucleic acid 
molecules. Preferably, they are derived from lactic acid bacteria or bifidobacteria. 
Nucleic acid molecules hybridizing to the nucleic acid molecules of the present invention 
can be isolated, e.g., from genomic libraries or cDNA libraries of various organisms. 

5 

Such nucleic acid molecules can be identified and isolated by using the nucleic acid 
molecules of the present invention or fragments of these molecules or the reverse 
. complements of these molecules, e.g. by hybridization according to standard techniques 
(see Sambrook el a/. (1989)). 

.10 

As hybridization probe, e.g. nucleic acid molecules can be used that have exactly or 
substantially the same nucleotide sequence indicated in the Figure 5 or fragments of said 
sequence. Preferably is used the entire nucleotide sequence of the coding sequence of the 
sip A gene. The fragments used as hybridization probes can also be synthetic fragments 
15 obtained by conventional synthesis techniques and the sequence of which is substantially 
identical to that of the nucleic acid molecules according to the invention. Once genes 
hybridizing to the nucleic acid molecules of the invention have been identified and 
isolated it is necessary to determine the sequence and to analyze the properties of the 
proteins coded for by said sequence. 

20 

The term "hybridizing DNA molecule" includes fragments, derivatives and allelic 
variants of the above-described nucleic acid molecules that code for the above-described 
protein (or its equivalent) or a biologically active fragment thereof. Fragments are 
understood to be parts of nucleic acid molecules long enough to code for the described 

2 5 protein (or its equivalent) or a biologically active fragment thereof. The term "derivative" 

means in this context that the nucleotide sequences of these molecules differ from the 
sequences of the above-described nucleic acid molecules in one or more positions and are 
highly homologous to said sequence. 

3 0 The present invention is directed also to DNA molecules which are homologous with the 

DNA molecules contained in the coding sequences of any of SEQ ID 1 to SEQ ID 6 or with 
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the DNA molecules encoding a polypeptide having the amino acid sequence contained in 
any of SEQ ID 7 to SEQ ID 12 or with the degenerated forms of these DNA molecules. 
"Homology" is understood to refer to a sequence identity of at least 50%, preferably 
more than 70% and still more preferably more than 90% on the length of at least 300 
5 nucleotides. The deviations from the nucleic acid molecules described above can be the 
result of deletion, substitution, insertion, addition or combination. 

Homology furthermore means that the respective nucleotide sequences or encoded 
proteins are functionally and/or structurally equivalent. The DNA molecules that are 

10 homologous tothe DNA molecules described above and that are derivatives of said DNA 
molecules are regularly variations of said molecules which represent modifications having 
the same biological function. They may be naturally occurring variations, such as 
sequences of other organisms or mutations. These mutations may occur naturally or may 
be achieved by specific mutagenesis. Furthermore, these variations may be synthetically 

15 produced sequences. 

The present invention furthermore relates to DNA molecules, the sequences of which 
have an amino acid sequence which shows at least 40 % identity, preferably at least 50 % 
identity, to a sequence contained above and which code for a polypeptide capable of 

2 0 binding to human and/or animal epithelial cell types, like intestinal, urogenital and/or 

endothelial cells. The amino acid sequences of this invention show identity of less, than 
36% compared to any known amino acid sequence. When comparing the L. brevis slpA 
protein amino acid sequence with amino acid sequences of the prior art S-layer proteins 
known from lactobacilli, it was found that the highest identity was below 36 %. 

25 

By an amino acid sequence that is an "equivalent" of a specific amino acid sequence is meant 
an amino acid sequence that is not identical to the specific amino acid sequence, but rather 
contains at least some amino acid changes (deletion, substitutions, inversions, insertions, 
etc.) that do not essentially affect the biological activity of the protein as compared to a 

3 0 similar activity of the specific amino acid sequence, when used for a desired purpose. The 

biological activity of a polypeptide means here the capability of binding to epithelial cells. 
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Preferably, an "equivalent" amino acid sequence contains at least 40% - 99% identity at the 
amino acid level to the specific amino acid sequence, most preferably at least 50%, more 
preferably at least 60% and in an especially highly preferable embodiment, at least 95% 
identity, at the amino acid level. 

5 

The term "binding" is used here to mean the adherence of a cell, protein, protein region or 
polypeptide reasonably firmly to an epithelial cell, like intestinal, urogenital and/ or 
endothelial cell type . According to this invention the binding capacity has been measured by 
determining the binding of polypeptides encoded by DNA molecules expressed as gene 
1 0 fusions in the variable region of the /7/C H7 gene of Escherichia coli. The resulting chimeric 
flagella were assessed by indirect immunofluorescence for binding to human intestinal and 
human urinary bladder cells. The binding capacity was visually characterized to be very . 
strong (++++), strong (+++), weak (+) or no binding at all (-). By binding is meant the 
adherence of proteins or cells to the epithelial cells, particularly to the surface of the cell. 

15 

The binding was specific: chimeric flagella harbouring inserts that represented the N- 
terminal part of the S-layer protein bound to intestinal as well as urinary bladder cell types, 
whereas the C-terminal part of the S-layer. protein did not confer binding on the chimeric 
flagella.The S-layer expressing bacterium Lactobacillus bi'evis ATCC 8287 efficiently 
2 0 adhered to the human small intestinal cell line and to the human urinary bladder cell line. 
Bacterial adhesiveness to both cell lines was completely abolished after removal of the S- 
layer protein (SlpA) from the bacterial surface by guanidine hydrochloride extraction. 

In Table I is shown the differences in binding of different strains of lactic acid bacteria to 
2 5 human epithelial cell line (Intestine 407). 
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Table I. Binding of different strains of lactic acid bacteria to human epithelial cell line 
Intestine 407. 

Strains Binding capacity 

5 • 

L acidophilus JCM1132 

L. crispatus JCM5810 +++* 





A296-21 


+ 


L. amylovorus 


F81 






JCM5807 


+ 


L. gallinarum . 


T-50 




L. gasseri 


JCM1130 


+ 


L. johnsonii 


5F49 


+ 


L. brevis 


ATCC 8287 


MM 



15 

* binding to the extracellular matrix secreted by the cell (see Toba et al, AppL Environ. 
Microbiol 61:2467-2471 (1995)) 

As can be seen in Table I, from the lactobacilli tested only L. brevis binds strongly to 
2 0 epithelial cells, particularly to the surface of the cell. 

By "lactic acid bacteria" are meant all Gram-positive; anaerobic, microaerophilic or aero- 
tolerant; catalase negative; rods or cocci; most importantly they all produce lactic acid as 
sole, major or important product from the energy-yielding fermentation of sugars. In 

2 5 practice, genuine members of lactic acid bacteria include at least the following 

genera: Lactobacillus, Lactococcus, Leuconostoc, Pediococcus, Carnobacterium, 
Sporolactobacillus, Streptococcus, Enter -ococcus, Aerococcus, Vagococcus, 
Tetragenococcus and Atopium. Many characteristics typical to genuine LAB are also 
common to the genus Bifidobacterium which consists of important health-promoting 

3 0 intestinal bacteria. 
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Transfer of the binding property to new host cells 

It is possible with the nucleotide sequences of this invention to modify or improve the 
binding capability of various prokaryotic or eukaryotic hosts to human and/or animal 
5 epithelial cell types, like intestinal, urogenital and/or endothelial cell types. The hosts cells 
are improved or modified to have the binding capability by transferring the host cells by at 
least one of the DNA molecules of this invention. 

The binding capability may be transferred to any suitable bacterial host, for example to 
10 strains of lactic acid bacteria or bifidobacteria or to a fungal strain, like to a yeast strain. 

A nucleotide sequence of this invention may be inserted into a DNA vector with 
conventional techniques, including blunt-ending or staggered-ending termini for ligation, 
restriction enzyme digestion to provide appropriate termini, filling in of cohesive ends as 
15 appropriate, alkaline phophatase treatment to avoid undesirable joining, and ligation with 
appropriate ligases. Techniques for such manipulations are described in Sambrook et al. 
(1989, Molecular Cloning, A Laboratory Manual 2nd Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY). 

2 0 To express a desired coding sequence, transcriptional and translational signals recognizable 
by the host are necessary. The nucleotide sequences of this invention may be operably linked 
to the transcriptional and secretory regulatory elements in an expression vector, and 
introduced into a host cell to produce desired protein under the control of such sequences. 
A DNA molecule is said to be capable of expressing a polypeptide if it contains expression 

2 5 control sequences which contain transcriptional regulatory information and such sequences 
are operably linked to the nucleotide sequence which encodes the desired polypeptide. An 
operable linkage is a linkage in which a sequence is connected to a regulatory sequence in 
such a way as to place expression of the sequence under the influence or control of the 
regulatory sequence. 

30 

Where the protein or protein region expression and secretion control sequences do not 
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function satisfactorily in a host cell, then sequences functional in the host cell may be 
substituted as necessary. 



The vectors of the invention may comprise other operable linked functional elements such as 
5 DNA elements which confer antibiotic resistance on a host cell, and which provide for an 
origin of replication, or for insertion of a desired sequence into the chromosome of a host 
cell. 

To transform a host cell with the DNA constructs of the invention many vector systems are 
10 available depending upon whether it is desired to insert the desired protein's DNA construct 
into the host cell chromosomal DNA, or to allow it to exist in an extrachromosomal form. 

Cells which have stably integrated the introduced DNA into their chromosomes are selected 
by also introducing one or more markers which allow selection of host cells which contain 
15 the expression vector in the chromosome, for example the marker may provide resistance to 
antibiotics. The selectable marker gene (that can be later removed by methods well known in 
the art) can either be directly provided on the same vector as that providing the desired 
DNA gene sequences to be expressed, or such markers may be introduced into the same cell 
by co-transformation. 

20 

Factors of importance in selecting a particular plasmid or phage vector include the ease with 
which recipient cells that contain the vector may be recognized and selected from those 
recipient cells, which do not contain the vector and the number of copies of the vector 
which are desired in a particular host. 

25 

After the introduction of the vector, recipient cells are grown in a selective medium, which 
selects for the growth of vector-containing cells. Expression of the nucleotide sequences of 
this invention result in the production of the desired protein, or the production of a fragment 
of this protein and/or a host having the desired properties. 

30 

The above mentioned DNA molecules including the entire gene of L. brevis slpA can be 
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expressed on the surface of other selected well known probiotic microbes in order to 
enhance and target their adhesion to the host cells to allow more efficient colonization. 

Also the degree of the adhesion mediated by the polypeptides of this invention can be varied 
5 by modifying their expression by different copy numbers, promoter strength regulation or 
by changing the length and amino acid composition of the binding domain with genetic 
engineering. With amino acid modifications encoded by the DNA molecules of this invention 
. their host specificity may be affected if required.. 

10 Probiotics have been defined as live micro-organisms which beneficially affect the health of 
the host (human or animal ) by improving its intestinal balance. To date, the term probiotic 
has been widened to include also live preparations used genitourinary to prevent infections 
and restore the disturbed microbiological ecological balance. Different characteristics 
associated with potentially health affecting bacteria may include e.g. i) acid and bile 

15 stability, ii) adherence to intestinal cells, iii) colonization of the intestinal tract, iv) 

production of antimicrobial substances, v) antagonism against pathogenic bacteria and vi) 
safety in food and clinical use. 

According to this invention the probiotic effects of the hosts having (or modified to have) 
2 0 the binding capacity to epithelial cells can be enhanced by genetic means. 

Various hosts of this invention can be used to colonize the human or animal gastrointestinal 
or urogenital tract and to exclude pathogens by binding to cell receptors, which would 
otherwise be bound by pathogens or by growing to cell surface areas, which would 

2 5 otherwise be colonized by pathogens. 

As an example of establishing a surface expression of the polypeptides of this invention, 
one can apply well conserved cell wall binding anchors (e.g. C-terminal sequence of PrtP 
from L. lactis or other lactic acid bacteria, staphylococcal protein A, Streptococcus 

3 0 pyogenes M6 protein, yeast anchor sequences etc.), fused to a spacer region and a DNA 

molecule of this invention which is preceded by a suitable promoter and signal sequence (see 
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e.g. Piard et al. J. Bacterial 179:3068-3072 (1997) and Steidler et al Appl Environ. 
Microbiol. 64: 342-345 (1998). 



Enhancing properties of oral vaccine carriers 

5 

The above mentioned DNA molecules including the entire gene of L. brevis slpA can be 
transferred to another microorganism which could be for example a live vaccine carrier or a 
new putative host developed for vaccine purpose to enhance their adhesion properties. 

10 As described above the degree of adhesion can be Varied by modifying the expression of 
the DNA molecules of this invention by different copy numbers, promoter strength 
regulation or by changing the length and amino acid composition of the binding domain 
with genetic engineering. With amino acid modifications encoded by the DNA molecules of 
this invention their host specificity may be affected if required. 

15 

The portals of entry of many pathogens are mucosal surfaces and most infections also 
begin at some mucosal site. The natural immune response to a pathogen is also likely to 
begin at the mucosal site of entry. To induce such a response with a vaccine, a similar route 
for its delivery may be regarded as a rational approach. Most vaccines available to date are, 
2 0 however, injected parenterally giving systemic response alone which may not be long lasting 
and protective enough. Oral vaccines are most likely to prevent intestinal diseases as they 
stimulate mucosal associated lymphoid tissue in the gastrointestinal tract directly. Thus, 
new strategies to prevent the adhesion, multiplication and invasion of pathogens at 
mucosal surfaces have become more acute and in some instances the only way to prevent 

2 5 an infectious disease. Furthermore, the use of oral (or other mucosal) routes for 

immunization against infective diseases is desirable because oral vaccines are easier to 
administer, have higher compliance rates and are likely to be less expensive. Experimental 
data indicate that colonization is required for example for immune response when using 
recombinant Streptococcus gordonii expressing surface antigens (Fischetti et al Current 

3 0 Opinion in biotechnology 7:659-666 (1996) and Medaglini et al. Proc.NatlAcad.Sci USA 

92:6868-6872 (1995). 
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Mediating effective adherence to mucosal cells by the sequences of this invention, 
microorganisms carrying a homologous region to the identified binding region possess 
excellent properties as novel live oral vaccine carriers. Furthermore, it has been 
5 documented that surface (S-) layers function as effective adjuvants. In addition to the 
binding function, the C-terminal part of the SlpA gene can be modified to carry desired 
antigen epitopes and to efficiently present them as multiple copies at the S-layer formed by 
the SlpA subunits. Either a heterogenous or uniform S-layer can be formed depending on 
whether the host carries both an antigen-expressing and wild-type slpA gene or only the 
10 modified antigen-expressing gene. By this the amount of antigen on the cell surface may be 
affected. As described above, the vaccine antigen can also be expressed on the cell surface 
apart from a S-layer which in such a case functions as adhesion factor and adjuvant only. 

Preferably expression hosts for oral vaccine carriers are strains of lactic acid bacteria and 
15 bifidobacteria, more preferably L. brevis species, in particular the strain L. brevis 
ATCC8287. 

Transfer of the binding property to new proteins 

2 0 In addition to the use of the polypeptides of this invention responsible of the binding 

property in living vaccine carriers, also non-living applications for these structures are fully 
possible. The desired antigen polypeptides and shorter antigen epitopes having been 
modified to have the binding property of this invention can be expressed in the host chosen 
and administered after isolation and purification as pure protein preparations. If required, 

2 5 such preparations may be protected against degradation by using inert particles such as 
biodegradable microparticles, liposomes and cochelates (O'Hagan, Novel Delivery Systems 
for Oral Vaccines. CRC Press, Florida (1994)). 

30 
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Applications of the molecules of this invention in bifunctional target-specific 
biomolecules 

An unexplored but highly possible application for the DNA molecules of this invention are 
5 their use in bifunctional molecules i.e. by creating fusion constructs where the binding 
region mediates the adhesion to mucosal cells and the other domain functions as an active 
target-specific molecule. 

Such bifiinctionally acting molecules could be formed for example by combining the DNA 
10 molecules of this invention with DNA sequences encoding enzymes, single chain antibodies 
or pharmaceutical proteins or toxins. 

Targeted enzymes could be applied for example in the gut for degradation of (i) lactose by 
P-galactosidase to decrease adverse effects in lactose intolerance or ii) milk proteins by 
15 proteases and peptidases e.g. to release bioactive peptides or to increase tolerance against 
milk allergy. Certain toxins bound specifically to the polypeptides of this invention could 
also be used for destruction of pathogens at different mucosal sites. The delivery of the 
bifunctional molecule could be by direct spray or liquid preparations (nasal mucosa), by 
melting capsules (vaginal mucosa) or by inert particles described above for vaccine delivery 

2 0 systems (gastrointestinal tract). 

With single chain antibodies and pharmaceutical proteins linked to the binding regions of 
this invention unlimited amounts of applications for mucosal prevention and medication of 
infectious diseases can be predicted. 

25 

Use of Lactobacillus brevis as a probiotic host 

According to this invention L. brevis as such may function as a novel probiotic strain 
particularly due to its highly efficient binding capacity both in the gut and urinary tract even 

3 0 though its use in practical applications is yet unraveled . Its probiotic effect can be further 

enhanced for example by introducing genes encoding i) production of selected 
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bacteriocin(s) and the corresponding immunity and ii) other antimicrobial substances to 
antagonist pathogens and iii) enzymes increasing its metabolic activity towards available 
substrates to strengthen its competetiveness in the chosen niche. 

5 Also the degree of the Sip A mediated adhesion can be varied by modifying its expression 
by different copy numbers, by promoter strength regulation or by changing the length and 
amino acid composition of the binding domain with genetic engineering. With amino acid 
modifications of the SlpA binding domain their host specificity may be affected if required. 

10 In addition to the demonstrated slpA encoded binding capacity of I. brevis ATCC8287, this 
strain is also highly resistant in low pH and bile thus possessing the key characteristics of a 
probiotic strain. L. brevis strains have also been shown to have antagonistic effects against 
intestinal pathogens. Furthermore, L. brevis can be regularly found in the intestine of man 
and animals as well as in a variety of fermented food products. 

15 

The following examples and figures provide further details of the invention. 
Example 1 

2 0 Bactena-The strain ATCC 8287 of L. brevis and its slpA gene have been described ( 

Vidgren et al J. BacterioL 9:7419-7427 (1992)). The bacteria were grown overnight at 37 
°C in 20 ml of static MRS broth! To extract the S-layer (Masuda, K. & Kawata, T. 
Microbiol Immunol 23:941-953 (1979)), the bacterial cells were washed once with 
phosphate buffered saline, pH 7.1 (PBS) and once with distilled water, and the cells were 

2 5 then suspended in 0.8 ml of distilled water. The cells were then incubated for 2 h at 37 °C 

with 7.2 ml of 2 M guanidine hydrochloride; the control cells were incubated similarly in 
PBS alone. The guanidine hydrochloride-treated cells were washed once with 2 M guanidine 
hydrochloride and then twice with PBS; the control cells were washed twice with PBS. For 
adherence assays, the bacterial cells were suspended in the cell culture medium used with 

3 0 the target epithelial cell lines (see below). 
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Bacterial adherence assays-The bacterial adhesion to cultured human epithelial cells was 
evaluated essentially as described earlier ( Tarkkanen el aL, Infect Immunol 65:1546-1549 
(1997)). The human small intestine Intestine 407 (ATCC CCL6) cells were cultivated to 
confluence in RPMI 1640 medium (Gibco Laboratories, Grand Island, N.Y.) supplemented 
5 with 10 % (w/v) fetal calf serum (PAA Laboratories Gmbh, Linz, Austria), 1 % (w/v) L- 
glutamine (Life Technologies,), 1 % (w/v) nonessential amino acids (Gibco Laboratories, 
Grand Island, N.Y.), and gentamycin (50 p.g/ml). The human urinary bladder transitional 
T24 cells (ATCC HTB-4) were cultered in McCoy's 5A medium (Life Technologies) 
supplemented with fetal calf serum, L-glutamine, and gentamycin as above. The cell lines 

10 were cultured on diagnostic glass slides (Knittel Glassbearbeitungs GmbH, Braunschweig, 
Germany). Before the adhesion assays, the cells were washed once with PBS. The bacteria 
were suspended in RPMI 1640 medium at the concentrations ranging from 5 x 10 7 to 10 9 
cells/ml, and 40 \x\ of the suspension per well was added to the epithelial cells and the slides 
were incubated for 1 h at 3 7 °C in a moist chamber. The slides were washed five times at 

15 room temperature with PBS for five min each and fixed for 10 min with methanol. The cells, 
with adherent bacteria were then examined in a BX50 microscope (Olympus Optical Co., 
Hamburg, Germany) either directly by Nomarski interference optics (for photographing) or, 
for quantitative analysis, stained for 5 min with 10 % v/v Giemsa stain and analyzed by light 
microscopy. The number and standard deviation of adherent bacteria on 20 epithelial cells 

2 0 was calculated. In inhibition studies, purified Fab fragments from anti-SlpA immuno- 

globulins (Vidgren et aL J. Bacteriol 9:7419-7427 (1992)) or from control immuno- 
globulins raised against fimbrial proteins of Escherichia coli (Westerlund et aL ZbL Bakt. 
278:229-237 (1993)) were incubated with the bacterial suspensions for 30 min over crushed 
ice prior to the adhesion assay, the control bacteria were incubated similarly but in PBS 
25 alone. The Fab fragments were prepared by a routine procedure (Porter, Biochem. J. 
73:1 19-126 (1959)) and tested at the final concentration of 500 pig/ml. 

Example 2 

3 0 Flagella display-The principle of the flagella display system used here was recently 

described (Westerlund-Wikstrom et aL , Prot Engin. 10:1319-1326 (1997)). Fragments 
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representing different parts of the slpA gene were amplified by polymerase chain reaction 
(PCR) with Pfu polymerase and using chromosomal DNA from the L. brevis strain ATCC 
8287 as the template. The primers were designed on the basis of the nucleotide sequence of 
slpA ( Vidgren et al. J. Bacterial. 9:7419-7427 (1992)) and contained an^ccl restriction 
5 site at the 5 ' termini. The primers encoded the SlpA peptide sequences 3 1-245 (SEQ ID 
NO. 10; this peptide sequence is encoded by the nucleotide sequence SEQ ID NO. 4, which 
represents the nucleotide residues from 91-735), 3 1-300 (SEQ ID NO. 1 1 this peptide 
sequence is encoded by the nucleotide sequence SEQ ID NO. 5, which represents the 
nucleotide residues from 91-900), 96-200 (SEQ ID NO. 7; this peptide sequence is encoded 
.0 by the nucleotide sequence SEQ ID NO. 1, which represents the nucleotide residues 

from286-600), 96-245 (SEQ ID NO. 8; this peptide sequence is encoded by the nucleotide 
sequence SEQ ID NO. 2, which represents the nucleotide residues from286-735), 96-370 
(SEQ ID NO. 9; this peptide sequence is encoded by the nucleotide sequence SEQ ID NO. 
3, which represents the nucleotide residues from 286-11 1 0), or 239-447 (SEQ ID NO. 13; 
L 5 this peptide sequence is encoded by the nucleotide sequence SEQ ID NO. 14, which 

represents the nucleotide residues from 715-1341), where the residue numbers include the 
30-mer signal sequence of the SlpA peptide. The slpA fragments were cloned into the /led 
site in the plasmid pFHC H7 . deleted for 174 bp in the variable region oifliC and expressed in 
trans in£. coli that is fliC:.Tn\Q zndfimA::cat (Westerlund-Wikstrom et alProt. Eng. 
2 0 10: 13 19-1326(1997)). The flagellar filaments were extracted and, after a sodium 

dodecylsulfate gel electrophoresis (SD S-PAGE), adjusted to an equal concentration of the 
FliC peptide as described recently (Westerlund-Wikstrom el al, Prot. Engirt 10:1319-1326 
(1997)). The flagella lacking an insert, i.e. the AFliC filaments, were available from previous 
work (Westerlund-Wikstrom et a/.(1997)). 



25 



30 



Example 3 

Binding tests with chimeric flagella-Jht binding of the chimeric flagella onto the epithelial 
cells was assessed by indirect immunofluorescence as detailed recently (Westerlund- 
Wikstrom et a/.(1997)). Briefly, the epithelial cells were washed at room temperature with 
PBS, fixed with methanol for 10 min at -20 °C, and then washed with PBS at room 
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temperature. The flagellar extracts (40 jil; 20 jig/ml in PBS) were added and the slides were 
kept for 5 h at 4°C. After washing and a second fixing with methanol, the bound flagella 
were visualized by staining with immunoglobulin G molecules from an anti-H7-flagella 
rabbit antiserum and with fluorescein isothiocyanate-labelled secondary antibodies as 
5 detailed (Westerlund-Wikstrom et a/.(1997)). The control assays included staining of the 
epithelial cells as above but using the AFliC flagella lacking an insert, or omitting the 
flagellar extract, or the flagellar extract and the immunoglobulins in the staining procedure. 

Immunological methods-Tor immuno electron microscopy, bacterial cells expressing the 

1 0 various flagellar constructs were suspended in Luria broth and immobilized on copper grids 
coated with Pioloform and carbon. The bacteria were left to react with an anti-H7-flagella 
antiserum (Westerlund-Wikstrom et al (1997)); diluted 1/300 in PBS containing 10 mg/ml 
BSA or with an anti-SlpA antiserum (Vidgren et al J. Bacteriol 9:7419-7427 (1992)); 
diluted 1/300 in PBS-BSA) for 90 min at room temperature . The grids were washed in PBS 

15 containing 1 mg/ml BSA, and the bound antibodies were detected with Auroprobe™EM 
Protein A-conjugate (Amersham, Amersham Place; Little Chalfont, Buckinghamshire, UK; 
diluted 1/40). The grids were examined in a Jeol JEM-100CX transmission electron 
microscope at an operating voltage of 60 kV. For immunoblotting, flagellar preparations 
were analyzed by SDS-PAGE using a 1 % (w/v) stacking gel and a 10 % separating gel. 

2 0 Polypeptides were transferred onto a nitrocellulose membrane using a semi-dry transfer 
apparatus at 0.9 mA7cm 2 membrane for 2 h at 4 °C. After transfer, the membrane was 
quenched with PBS containing 20 mg/ml BSA for 16 h at 20 °C and washed with PBS. 
Polypeptides were visualized by staining with diluted polyclonal anti-flagella antibodies or 
anti-SlpA antibodies and alkaline-fosfatase-conjugated secondary antibodies as described 

2 5 (Westerlund-Wikstrom et al (1997)). A phosphatase substrate solution containing nitroblue- 
tetrazolium (162 ng/ml) and 5-bromo-4-chloro-3-indolyl-l -phosphate (370 jag/ml) was 
used. 

Example 4 

30 

Adherence ofL. brevis A TCC 8287 to intestinal cells-We initially assessed the adhesiveness 



BNSDOCID: <WO 9*951631 A1 . 1_> 



PCT/FI99/00290 

WO 99/51631 



24 

of the L. brevis strain ATCC 8287 to the human small intestine cell line Intestine 407. The 
strain showed an efficient adhesion to the intestinal cells (Fig. 1 A). The cells of I. brevis 
ATCC 8287 express the S-layer protein SlpA as their major cell surface protein (Vidgren et 
al. J. Bacterid . 9:7419-7427 (1992)), and we therefore extracted the S-layer from the 

5 bacterial surface and determined how this affected the adhesion. Extraction of cells with 2M 
guanidine hydrochloride is a routine procedure to remove bacterial S-layers, and the 
treatment does not lyse the bacterial cells. Treatment with guanidine hydrochloride 
completely abolished the adhesiveness of L. brevis (Fig. IB). The S-layer peptide was the 
' dominant peptide species released from the cells (Fig. 2B), and the results suggested a role 

10 for SlpA in bacterial adhesiveness. 

Example 5 

Expression ofslpA fragments mfliC-Tht slpA gene encoding the S-layer protein of L. 
15 brevis ATCC 8287 has been described (Vidgren et al. J. Bacterial. 9:7419-7427 (1992)). 
We cloned fragments of slpA into the Accl site in the plasmid pFHC H7A that contains a 174- 
bp deletion in the variable region of fliC, and the chimeric flagella were expressed in E coli 
JT1 that isJliC.TnlO zxAfimAv.cat. Schematic presentation of the slpA fragments 
expressed by the flagella display are shown in Fig.3 Western blots of the flagellar 
2 0 preparations with anti-H7 and anti-Sip A polyclonal antibodies showed that the apparent size 
of the chimeric flagellins corresponded to those predicted from the nucleotide sequence, i.e. 
it increased with the size of the insert in AfliC (data not shown). The polypeptides of smaller 
size that were present in the preparations and also reacted with the antibodies most likely 
were flagellar minor proteins (as noted earlier; Westerlund-Wikstrom et al, Prot. Engin. 
25 10:1319-1326 (1997)) or degradation products. We noted by electron microscopy (data not 
shown) that the chimeric flagella expressing the larger inserts (> 200 amino acids) had short 
flagellar filaments and thus these preparations were reduced in the relative amount of FliC as 
compared to the hook and cap proteins of the flagella. The chimeric flagella SlpA31- 
245/AFliC, SlpA31-300/AFliC, SlpA96-370/AFliC and SlpA239-447/AFliC reacted with 
3 o both the anti-H7 and the anti-SlpA antibodies, whereas the flagella with the two shortest 

inserts in the constructs SlpA96-245/AFliC and SlpA96-200/AFliC reacted with the anti-H7 
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antibodies and showed the expected apparent molecular size but failed to react with the anti- 
SlpA antiserum in Western blots. 

Example 6 

. 5 

Binding of chimeric flagella to epithelial cells-VJe next analyzed by an indirect 
immunofluorescence assay the binding of the chimeric flagella to Intestine 407 cells, 
representative examples of the assays are shown in Fig. 4. The SlpA3 1-300/AFliC construct, 
carrying the N-terminal part of the molecule, exhibited binding to the Intestine 407 cells, 

10 whereas the construct Sip A23 9-44 7/AFliC representing the C-terminal part, failed to bind 
(data not shown). No binding was observed with the AFliC construct lacking an insert. 
These results indicated that the binding region is located within the N-terminal part of the 
Sip A molecule. We constructed and tested various fragments covering different regions of 
the N-terminus, and the shortest fragment supporting adhesion to Intestine 407 cells was the 

15 construct SlpA96-200/AFliC that contained an 105 amino acid-long insert (Fig. 4 A). 

We also tested the binding of the SlpA96-200/AFliC construct to the human urinary bladder 

r 

cell line T24 (Fig. 4C). The chimeric flagella bound efficiently to the urinary bladder cells, 
whereas no binding of the AFliC flagella were observed. The L. brevis ATCC 8287 cells 

2 0 exhibited an efficient adhesion to the urinary bladder cell line, and the adhesiveness was 

abolished after treatment of the bacteria with guanidine hydrochloride (data not shown). It 
was also ahown that L. brevis ATCC 8277 cells bound to CaCo-2 cells. 

Example 7 

25 

Hybridization of the sip A gene ofL. brevis ATCC8287 to other Lactobacillus strains- 
The aim of this experiment was to study the presence of L, brevis sip-gene homologs in 
other Lactobacillus strains. For this purpose chromosomal DNA was isolated from the 
following L. brevis strains: all the six strains available in the DSM culture collections (DSM 

3 0 20054, 1267, 1268, 1269, 2647, 2647, 6235), five strains of the Japanese collection (Yasui 

et al FEMS Microbiology Letter 133:183-186 (1995), one strain (VK3) from the culture 
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collection of TNO (delivered by prof. Peter Powels), one strain of the University of Cornell , 
U.S.A. (delivered by prof. Carl Batt) and one own isolate from pig intestine identified by 
API asL. brevis/L. buchneri. In addition, several chromosomal DNA samples of S-layer 
carrying Lactobacilli isolated from pig intestine and identified by API as putative members 
5 of L. acidophilus, L. fermentum, L. crispatus and L. delbrueckii species were tested. 

Hybridization was performed in the following buffer: 5xSSC, 1% Blocking Reagent 
(Boehringer Mannheim), 0,02% SDS, 0,1 % layryl sarcosine at 68°C overnight using the full 
length L. brevis slpA gene labelled with DIG (digoxigenin- dUTP, Boehringer Mannheim) as 
1 0 probe. Hybridization was followed by washing steps in descreasing salt concentration and 
increasing temperature. The final wash step was in O.lxSSC, 0.1% SDS at 68°C for 15 mm 
twice. The negative control used was and equal amount of E.coli or calf thymus DNA to 
that of test DNA. 

15 All L. brevis strains tested gave a positive hybridization signal except the strain VK3 from 
TNO and the Japanese strain Yasui 0296961015. These two hybridization negative L. brevis 
strains were also shown to lack the S-layer protein by SDS-PAGE analysis. Furthermore, 
the other above mentioned S-layer expressing lactobacilli not belonging to the L. brevis 
and/or L. buchneri group remained negative in the hybridization test. 



20 



25 



We have also shown previoustly in WO 94/00581 that the chromosomal DNA of L. 
buchneri strain DSM 20057 strongly hybridizes with the sip-gene probe used above under 
stringent conditions, whereas S-layer protein carrying L. helveticus and L. acidophilus 
strains remains hybridization negative under same conditions. 

We have also shown that the L. brevis/L. buchneri strains hybridizing with the ATCC8287 
sip-probe also effectively bind to the Intestine 407 cells whereas non-S-layer carrying L. 
brevis strains do not adhere to Intestine 407 cells. 



30 
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SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION : 



(i) APPLICANT: 

(A) NAME: Korhonen et al . 

(B) STREET : Huntutie 10 A 

(C) CITY: Helsinki 

(E) COUNTRY : Finland 

(F) POSTAL CODE (ZIP) : 00950 

TIT e^i?Ll INV T NTI ?? : A Pr ° tein regi ° n —P-sible of binding to 
epithelial cell types and a DNA sequence encoding said 
region 3 

(iii) NUMBER OF SEQUENCES: 16 

(iv) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: Floppy, disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(vi) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: FI 980782. 
(B) FILING DATE: 03 -APR- 199 8 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: • 
GTTAAGACCA CTAACCGTGG TTCAGTTTAC TACCGTGTTG TAACGATGGA TGGCAAGTAC 
CGTGGTTACG TTTATGGTGG CAAGTCTGAC ACTGCCTTTG CTGGTGGTAT CAAGTCTGCT 
GAAACGACTA CTAAGGCTGA TATGCCTGCA CGTACTACTG GGTTCTACTT AACTGACACT 
TCAAAGAACA CTCTTTGGAC GGCTCCTAAG TACACTCAAT ■ ACAAGGCAAG TAAAGTTAG C 
CTTTATGGTG TTG CTAAGG A CACCAAGTTT ACTGTAGATC AGGCTGCTAC TAAGACTCGT 
GAAGGTTCAT TATAC 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 450 base pairs 



60 
12 0 
180 
240 
300 
315 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GTTAAGACCA CTAACCGTGG TTCAGTTTAC TACCGTGTTG TAACGATGGA TGGCAAGTAC 6 0 

CGTGGTTACG TTTATGGTGG CAAGTCTGAC ACTGCCTTTG CTGGTGGTAT CAAGTCTGCT 12 0 

GAAACGACTA CTAAGGCTGA TATGCCTGCA CGTACTACTG GGTT CTACTT AACTGACACT 18 0 

15 TCAAAGAACA CTCTTTGGAC GGCTCCTAAG TACACTCAAT ACAAGGCAAG TAAAGTTAGC 24 0 

CTTTATGGTG TTGCTAAGGA CACCAAGTTT ACTGTAGATC AGGCTGCTAC TAAGACTCGT 3 0 

GAAGGTTCAT TATACTATCA CGTAACTGCT ACTAACGGTA GTGGTATTAG TGGTTGGATT 36 0 

TACGCTGGTA AGGGCTTCAG TACTACTGCT ACTGG T AC AC AAGTACTTGG TGGTCTGTCA 42 0 

ACTGATAAGT C AGTT AC AG C AACCAACGAT- 450 
25 (2) INFORMATION FOR SEQ ID NO : 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 
3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



40 



20 



50 



GTTAAGACCA 


CTAACCGTGG 


TTCAGTTTAC 


TACCGTGTTG 


TAACGATGGA 


TGGCAAGTAC 


6 r 


CGTGGTTACG 


TTTATGGTGG 


CAAGTCTGAC 


ACTGCCTTTG 


CTGGTGGTAT 


CAAGTCTGCT 


120 


GAAACGACTA 


CTAAGGCTGA 


TATGCCTGCA 


CGTACTACTG 


GGTTCTACTT 


AACTGACACT 


180 


TCAAAGAACA 


CTCTTTGGAC 


GGCTCCTAAG 


TACACTCAAT 


ACAAGGCAAG 


TAAAGTTAGC 


240 


CTTTATGGTG 


TTGCTAAGGA 


CACCAAGTTT 


ACTGTAGATC 


AGGCTGCTAC 


TAAGACTCGT 


300 


GAAGGTTCAT 


TATACTATCA 


CGTAACTGCT 


ACTAACGGTA 


GTGGTATTAG 


TGGTTGGATT 


360 


TACGCTGGTA 


AGGGCTTCAG 


TACTACTGCT 


ACTGGTACAC 


AAGTACTTGG 


TGGTCTGTCA 


420 


ACTGATAAGT 


C AGTT AC AG C 


AACCAACGAT 


AAC AG TGTT A 


AGATTGTTTA 


CCGTAC G ACT 


480 


GATGGCACTC 


AAGTTGGTTC 


TAAC AC TTGG 


GTAACTTCAA 


CTGATGGTAC 


AAAGGCAGGT 


540 


TCTAAGGTAA 


GCGATAAGGC 


CGCCGATCAA 


ACTGCTCTTG 


AAGCCTACAT 


C AATG CT AAC 


600 


AAGCCTAGCG 


GTTACACTGT 


AACTAACCCT 


AATG CTGCAG 


ATGCTACCTA 


TGGTAACACA 


660 
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GTTTACGCTA CTGTTTCCCA AGCAGCTACT TCTAAGGTCG CTTTGAAGGT CTCAGGGACT 72 0 

CCTGTTACTA CTGCATTGAC TACAGCTGAT GCTAATGATA AGGTTGCAGC TAACGATACC 7 80 

5 ACTGCTAATG GTAGTTCTGT TGCAGGCTCA ACAGTCTATG CTGCT 82 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

2 0 AAGTCATACG CTACTGCAGG TGCCTATTCA ACGTTAAAGA CGGACGCTGC TACTCGTAAC 60 

GTCGAAGCTA CTGGTACTAA CGCTTTATAC ACGAAGCCAG GTACTGTTAA GGGTGCTAAG 12 0 

GTTGTCGCTT CTAAGGCTAC TATGGCTAAG TTAGCTTCTT CAAAGAAGTC AG CTG ACTAC 18 0 

25 

TTCCGTGCTT ACGGTGTTAA GACCACTAAC CGTGGTTCAG TTTACTACCG TGTTGTAACG 24 0 

ATGGATGGCA AGTACCGTGG TTACGTTTAT GGTGG CAAGT CTGACACTGC CTTTGCTGGT 3 00 

3 0 GGTATCAAGT CTGCTGAAAC GACTACTAAG GCTGATATGC CTGCACGTAC TACTGGGTTC 3 60 

TACTTAACTG ACACTTCAAA GAACACTCTT TGGACGGCTC CTAAGTACAC TCAATACAAG 42 0 

G CAAGTAAAG TTAGCCTTTA TGGTGTTGCT AAGGACACCA AGTTTACTGT AGATCAGGCT 4 80 

35" 

G CT ACT AAG A CTCGTGAAGG TTCATTATAC TATCACGTAA CTGCTACTAA CGGTAGTGGT 54 0 

ATTAGTGGTT GGATTTACGC TGGTAAGGGC TTCAGTACTA CTGCTACTG G T AC AC AAG T A 600 

4 0 CTTGGTGGTC TGTCAACTGA TAAGTCAGTT ACAGCAACCA ACGAT 64 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

4 5 (A) LENGTH: 810 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

5 0 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
5 5 AAGTCATACG CTACTGCAGG TGCCTATTCA ACGTTAAAGA CGGACGCTGC TACTCGTAAC 60 
GTCGAAGCTA CTGGTACTAA CGCTTTATAC ACGAAGCCAG GTACTGTTAA GGGTGCTAAG 12 0 
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GTTGTCGCTT 


CTAAGGCTAC 


TATGGCTAAG 


TTAGCTTCTT 


CAAAGAAGTC 




180 




TTCCGTGCTT 


ACGGTGTTAA 


GACCACTAAC 


CGTGGTTCAG 


TTT ACT AC CG 




£. *± \J 


5 


ATGGATGGCA 


AGTACCGTGG 


TTACGTTTAT 


GGTGGCAAGT 


CTGACACTGC 


CTTTGLlGb 1 


inn 
jUU 




GGTATCAAGT 


CTGCTGAAAC 


GACTACTAAG 


GCTGATATGC 


CTGCACGTAC 


TACTGGLil 


^ £ n 


10 


TACTTAACTG 


ACACTTCAAA 


GAACACTCTT 


TGGACGGCTC 


CTAAGTACAC 


T C AAT AC AAb 


4 U 


GCAAGTAAAG 


TTAGCCTTTA 


TGGTGTTGCT 


AAGGACACCA 


AGTTTACTGT 


AG ATC AGG CT 


4 80 




GCTACTAAGA 


CTCGTGAAGG 


TTCATTATAC 


TATCACGTAA 


CTGCTACTAA 


CGGTAGTGGT 


540 


1-5 


ATTAGTGGTT 


GGATTTACGC 


TGGTAAGGGC 


TTCAGT^CTA 


CTGCTACTGG 


TACACAAGTA 


600 




CTTGGTGGTC 


TGTCAACTGA 


TAAGTCAGTT 


ACAGCAACCA 


ACGATAACAG 


TGTTAAGATT 


661 


20 


GTTTACCGTA 


CGACTGATGG 


CACTCAAGTT 


GGTTCTAACA 


CTTGGGTAAC 


TTCAACTGAT 


720 


GGTACAAAGG 
TACATCAATG 


CAGGTTCTAA 
CTAACAAGCC 


GGTAAGCGAT 
TAGCGGTTAC" 


AAGGCCGCCG 


ATCAAACTGC 


TCTTGAAGCC 


780 
810 



2 5 (2)- INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 95 base pairs 

(B) TYPE: nucleic acid 

3 0 (C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 

35 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Vidgren , G 

Palva, I 
Pakkanen, R 

4 0 Lounatmaa , K 

Palva, A 

(B) "TITLE: S-Layer Protein Gene of Lactobacillus brevis : 

Cloning by Polymerase Chain Reaction and 
Determination of the Nucleotide Sequence 
45 (C) JOURNAL: J. Bacterid. 

(D) VOLUME : 17 4 

(E) ISSUE: 22 

(F) PAGES: 7419-7427 

(G) DATE: 1992 

50 (K) RELEVANT RESIDUES IN SEQ ID NO : 6: FROM 1 TO 1395 

(x) PUBLICATION INFORMATION: 

(H) DOCUMENT NUMBER: WO 94/00581 Al 

(I) FILING DATE: 24-JUN-1993 

55 (J) PUBLICATION DATE: 06-JAN-1994 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

SUBSTITUTE SHEET (RULE 25) 
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ATGCAATCAA GTTTAAAGAA ATCTCTTTAC TTGGGCCTTG CCGCATTGAG CTTTGCTGGT 6 0 

GTTGCTGCCG TTTCAACGAC TGCTTCAGCT AAGTCATACG CTACTGCAGG TGCCTATTCA 12 0 

5 ACGTTAAAGA CGGACGCTGC TACTCGTAAC GTCGAAGCTA CTGGTACTAA CGCTTTATAC 18 0 

ACGAAGCCAG GTACTGTTAA GGGTGCTAAG GTTGTCGCTT CTAAGGCTAC TATGG CTAAG 24 0 

TTAG CTTCTT CAAAGAAGTC AG CTG ACTAC TTCCGTGCTT ACGGTGTTAA GACCACTAAC 3 00 

10 

CGTGGTTCAG TTTACTACCG TGTTGTAACG ATGGATGGCA AGTACCGTGG TTACG TTTAT 3 60 

GGTGGCAAGT CTGACACTGC CTTTGCTGGT GGTATCAAGT CTGCTGAAAC GACTACTAAG 42 0 

15 GCTGATATGC CTG CACGTAC TACTGGGTTC TACTTAACTG ACACTTCAAA GAACACTCTT 4 80 

TGGACGGCTC CTAAG TACAC TCAATACAAG GCAAGTAAAG TTAGCCTTTA TGGTGTTGCT 54 0 

AAGGACACCA AGTTTACTGT AGATCAGGCT G CTACTAAG A CTCGTGAAGG TTCATTATAC 6 00 

2 0 

TATCACGTAA CTG CTACTAA CGGTAGTGGT ATTAGTGGTT GGATTTACGC TGGTAAGGGC . 660 

TTCAGTACTA CTGCTACTGG TACACAAGTA CTTGGTGGTC TGTCAACTGA TAAGTCAGTT 72 0 

25 ACAGCAACCA ACGATAACAG TGTTAAGATT GTTTACCGTA CGACTGATGG CACTCAAGTT 780 

GGTTCTAACA CTTGGGTAAC TT CAACTG AT GGTACAAAGG CAGGTTCTAA GGTAAGCGAT 84 0 

AAGGCCGCCG ATCAAACTGC TCTTGAAGCC TACATCAATG CTAACAAGCC TAGCGGTTAC 90 0 

30 

ACTGTAACTA ACCCTAATGC TGCAGATGCT ACCTATGGTA ACACAGTTTA CGCTACTGTT 96 0 

TCCCAAGCAG CTACTTCTAA GGTCGCTTTG AAGGTCTCAG GGACTCCTGT TACTACTGCA 102 0. 

3 5 TTGACTACAG CTGATGCTAA TGATAAGGTT GCAGCTAACG AT AC CACTG C TAATGGTAGT 108 0 

TCTGTTGCAG GCTCAACAGT CTATGCTGCT GGTACTAAGT TGGCTCAATT AACAACTGAC 114 0 

TTGACTGGTG AAAAGGGTCA AGTTGTCACA TTAACTGCCA TCGATACTGA TTTGGAAGAC 12 0 0 

40 

G CTACGTTC A CTGGAACTAC GACTTACTAT TCAGATCTTG GTAAAG C ATA CCACTACACT 12 60 

TACACTTACA ATAAGGACAG TGCTGCTTCT TCAAATGCAA GTACCCAATT TGGTTCAAAC 13 2 0 

4 5 GTCACTGGTA CTTTAACTGC TACCCTTGTT ATGGGTAAGT CTACTGCTAC TGCTAACGGT 13 8 0 

ACTACTTGGT TCAAC 13 95 

(2) INFORMATION FOR SEQ ID NO: 7: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105. amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Val Lys Thr Thr Asn Arg Gly Ser Val Tyr Tyr Arg Val Val Thr Met 
1 5 10 .15 

5 

Asp Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr Ala 
20 25 30 

Phe Ala Gly Gly He Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp Met 
10 ' 35 40 45 

Pro Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lys Asn Thr 
50 55 60 

15 Leu Trp Thr Ala Pro Lys Tyr Thr- Gin Tyr Lys Ala Ser Lys Val Ser 

65 70 75 80 



20 



25 



Leu Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala Ala 
85 90 95 

Thr Lys Thr Arg Glu Gly Ser Leu' Tyr 
100 105 

(2) INFORMATION FOR SEQ ID NO : 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 0 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

3 0 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



35 



Val Lys Thr Thr Asn Arg Gly Ser Val Tyr Tyr Arg Val Val Thr Met 
1 5 10 15 



Asp Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr 'Ala 
40 20 25 30 

Phe Ala Gly Gly lie Lys Ser Ala Glu Thr Thr Thr -Lys Ala Asp Met 
35 - 40 45 

4 5 Pro Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lys Asn Thr 

50 55 60 

Leu Trp Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val Ser 
65 70 75 80 



50 



Leu Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala Ala 
85 90. 95 



Thr Lys Thr Arg Glu Gly Ser Leu Tyr Tyr His Val Thr Ala Thr Asn 
55 100 105 110 

Gly Ser Gly lie Ser Gly Trp lie Tyr Ala Gly Lys Gly Phe Ser Thr 

SUBSTITUTE SHEET (RULE 26) 
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50 



115 120 125 

Thr Ala Thr Gly Thr Gin Val Leu Gly Gly Leu Ser Thr Asp Lys Ser 
130 135 . " 140 

Val Thr Ala Thr Asn Asp . 
145" 150 

(2) INFORMATION FOR SEQ ID NO : 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: -275 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNE S S : 
15 (D) TOPOLOGY: both 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



Val Lys Thr Thr Asn Arg Gly Ser Val Tyr Tyr Arg Val Val Thr Met 
15 io 15 



Asp Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr Ala 

25 20 25 30 

Phe Ala Gly Gly lie Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp Met 
35 40 45 

3 0 Pro Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lys Asn Thr 

50 55 60 



Leu Trp Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val Ser 

65 70 75 80 

Leu Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala Ala 
85 90 95 



Thr Lys Thr Arg Glu Gly Ser Leu Tyr Tyr His Val Thr Ala Thr 1 Asn 
40 100 105 110 

. Gly Ser Gly lie Ser Gly Trp lie Tyr Ala Gly Lys Gly Phe Ser Thr 

115 120 125 

45 Thr Ala Thr Gly Thr Gin Val Leu Gly Gly Leu Ser Thr Asp Lys Ser 

13 0 13 5 14 0 



Val Thr Ala Thr Asn Asp Asn Ser Val Lvs He Val Tyr Arg Thr Thr 

145 150 155 " " 160 

Asp Gly Thr Gin Val Gly Ser Asn Thr Trp Val Thr Ser Thr Asp Gly 

165 170 175 



BNSDOCID: <WO 9&51631 A1 J_> 



Thr Lys Ala Gly Ser Lys Val Ser Asp Lys Ala Ala Asp Gin Thr Ala 
55 180 185 190 

Leu Glu Ala Tyr He Asn Ala Asn Lys Pro Ser Gly Tyr Thr Val Thr 
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195 200 205 

Asn Pro Asn Ala Ala Asp Ala Thr Tyr Gly Asn Thr Val Tyr Ala Thr 
210 215 220 

5 

Val Ser Gin Ala Ala Thr Ser Lys Val Ala Leu Lys Val Ser Gly Thr 
225 230 235 240 

Pro Val Thr Thr Ala Leu Thr Thr Ala Asp Ala Asn Asp Lys Val Ala 
10 245 250 255 

Ala Asn Asp Thr Thr Ala Asn Gly Ser Ser Val Ala Gly Ser Thr Val 
260 265 270 

15 Tyr Ala Ala 

275 

(2) INFORMATION FOR SEQ ID NO : 10: 

2 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 215 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: both 

25 

(ii) MOLECULE TYPE : peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

3 0 Lys Ser Tyr Ala Thr Ala Gly Ala Tyr Ser Thr Leu Lys Thr Asp Ala 

15 10 15 

Ala Thr Arg Asn Val Glu Ala Thr Gly Thr Asn Ala Leu Tyr Thr Lys 
20 25 30 

Pro Gly Thr Val Lys Gly Ala Lys Val Val Ala Ser Lys Ala Thr Met 
35 40 45 



35 



Ala Lys Leu Ala Ser Ser Lys Lys Ser Ala Asp Tyr Phe Arg Ala ! Tyr 
40 50 55 60 

Gly Val Lys Thr Thr Asn Arg Gly Ser Val Tyr Tyr Arg Val Val Thr 
65 70 75 80 

4 5 Met Asp Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr 

8 5 9 0 95 



50 



Ala Phe Ala Gly Gly lie Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp 

100 105 HO 

Met Pro Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lys Asn 

115 120 125 



Thr Leu Trp Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val 
55 130 135 140 

Ser Leu Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala 
SUBSTIT UT E SHEET (RULE 25) 
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145 15 ° 155 160 

Ala Thr Lys Thr Arg Glu Gly Ser Leu Tyr Tyr His Val Thr Ala Thr 
' 165 170 175 

Asn Gly Ser Gly He Ser Gly Trp He Tyr Ala Gly Lys Gly Phe Ser 
180 185 190 

Thr Thr Ala Thr Gly Thr Gin Val Leu Gly Gly Leu Ser Thr Asp Lys 
10 195 200 * 205 

Ser Val Thr Ala Thr Asn Asp 
210 215 

15 (2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 amino acids 

(B) TYPE: amino acid 
2 0 .(C) STRAND EDNESS : 

(D) TOPOLOGY: both 



25 



30 



55 



<ii) MOLECULE TYPE: peptide ' 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Lys Ser Tyr Ala Thr Ala Gly Ala Tyr Ser Thr Leu Lys Thr Asp Ala 
15 10 15 

Ala Thr Arg Asn Val Glu Ala Thr Gly Thr Asn Ala Leu Tyr Thr Lys 
20 25 30 

Pro Gly Thr Val Lys Gly Ala Lys Val Val Ala Ser Lys Ala Thr Met 
35 35 40 45 

Ala Lys Leu Ala Ser Ser Lys Lys Ser Ala Asp Tyr Phe Arg Ala Tyr 
50 55 60 

40 G1 Y Val L Y S Thr Thr Asn Arg Gly Ser Val Tyr Tyr Arg Val Val Thr 

65 7 0 75 ' 80 

Met Asp Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr 
85 90 95 

Ala Phe Ala Gly Gly He Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp 
100 105 no 

Met Pro Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lvs Asn 
50 H5 120 125 

Thr Leu Trp Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val 
130 135 140 



Ser Leu Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala 
145 150 155 * 160 
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Ala Thr Lys Thr 



Asn Gly Ser Gly 
5 180 

Thr Thr Ala Thr 
195 

10 Ser Val Thr Ala 

210 

Thr Asp Gly Thr 
225 

15 

Gly Thr Lys Ala 
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Arg Glu Gly Ser Leu Tyr 
165 - 170 

lie Ser Gly Trp lie Tyr 
185 

Gly Thr Gin Val Leu Gly 
200 

Thr Asn Asp Asn Ser Val 
215 

Gin Val Gly Ser Asn Thr 
2 3-0 

Gly Ser Lys Val Ser Asp 
245 250 



Tyr His Val Thr Ala Thr 
175 

Ala Gly Lys Gly Phe Ser 
190 

Gly Leu Ser Thr Asp Lys 
205 

Lys lie Val Tyr Arg Thr 
220 

Trp Val Thr Ser Thr Asp 
235 240 

Lys Ala Ala Asp Gin Thr 
255 



Ala Leu Glu Ala Tyr lie Asn Ala Asn Lys Pro Ser Gly Tyr 
20 260 265 270 

(2) INFORMATION FOR SEQ ID NO : 12 : 

SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : both 

3 0 (ii) MOLECULE TYPE: protein 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Vidgren, G 

Palva, I 

3 5 Pakkanen, R 

Lounatmaa, K 
Palva, A 

(B) TITLE: S-Layer Protein Gene of Lactobacillus brevis : 

Cloning by Polymerase Chain Reaction and 

4 0 Determination of the Nucleotide sequence 

(C) JOURNAL : J . Bacteriol . 

(D) VOLUME: 174 

(E) ISSUE: 22 

(F) PAGES : 7419-7427 
45 (G) DATE: 1992 

(K) RELEVANT RESIDUES IN SEQ ID NO: 12: FROM 1 TO 465 

PUBLICATION INFORMATION: 

(H) DOCUMENT NUMBER: WO 94/00581 Al 

(I) FILING DATE: 24-JUN-1993 
(J) PUBLICATION DATE: 06 - JAN- 1994 



(i) 



25 



(x) 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

5 5 Met Gin Ser Ser Leu Lys Lys Ser Leu Tyr Leu Gly Leu Ala Ala Leu 

1 5 10 15 
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Ser Phe Ala Gly Val Ala Ala Val Ser Thr Thr Ala Ser Ala Lys Ser 
20 25 30 

Tyr Ala Thr Ala Gly Ala Tyr Ser Thr Leu Lys Thr Asp Ala Ala Thr 
5 35 40 45 

Arg Asn Val Glu Ala Thr Gly Thr Asn Ala Leu Tyr Thr Lys Pro Gly 

50 55 60 

10 Thr Val Lys Gly Ala Lys Val Val- Ala Ser Lys Ala Thr Met Ala Lys 

65 70 75 80 



15 



Leu Ala Ser Ser Lys Lys Ser Ala Asp Tyr Phe Arg Ala Tyr Gly Val 
85 90 95 

Lys Thr Thr Asn Arg Gly Ser Val Tyr Tyr Arg Val Val Thr Met Asp 
100 105 ( 110 

Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr Ala Phe 
20 115 120 125 

Ala Gly Gly lie Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp Met Pro 
130 135 140 

2 5 Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lys Asn Thr Leu 

145 . 150 155 160 



30 



45 



Trp .Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val Ser Leu 
165 170 175 

Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala Ala Thr 
180 185 190 



Lys Thr Arg Glu Gly Ser Leu Tyr Tyr His Val Thr Ala Thr Asn Gly 

35 195 200 205 

Ser Gly lie Ser Gly Trp lie Tyr Ala Gly Lys Gly Phe Ser Thr Thr 

210 215 220 

4 0 Ala Thr Gly Thr Gin Val Leu Gly Gly Leu Ser Thr Asp Lys Ser Val 

225 230 235 240 



Thr Ala Thr Asn Asp Asn Ser Val Lys lie Val Tyr Arg Thr Thr Asp 
245 250 255 

Gly Thr Gin Val Gly Ser Asn Thr Trp Val Thr Ser Thr Asp Gly Thr 
260 265 270 



Lys Ala Gly Ser Lys Val Ser Asp Lys Ala Ala Asp Gin Thr Ala Leu 

50 275 280 285 

Glu Ala Tyr He Asn Ala Asn Lys Pro Ser Gly Tyr Thr Val Thr Asn 

290 295 300 

5 5 Pro Asn Ala Ala Asp Ala Thr Tyr Gly Asn Thr Val Tyr Ala Thr Val 

305 310 315 320 
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Ser.Gln Ala Ala Thr Ser Lys Val Ala Leu Lys Val Ser Gly Thr Pro 
325 330 335 

Val Thr Thr Ala Leu Thr Thr Ala Asp Ala Asn Asp Lys Val A.la Ala 
5 340 345 350 

Asn Asp Thr Thr Ala Asn Gly Ser Ser Val Ala Gly Ser Thr Val Tyr 
355 360 365 

1-0 Ala Ala Gly Thr Lys Leu Ala Gin Leu Thr Thr Asp Leu Thr Gly Glu 

370 375 380 



•15 



30 



40 



45 



Lys Gly Gin Val Val Thr Leu Thr Ala lie Asp Thr Asp Leu Glu Asp 

385 390 395 400 

Ala Thr Phe Thr Gly Thr Thr Thr Tyr Tyr Ser Asp Leu Gly Lys Ala 

405 410 415 



Tyr His Tyr Thr Tyr Thr Tyr Asn Lys Asp Ser Ala Ala Ser Ser Asn 
20 420 425 430 

Ala Ser Thr Gin Phe Gly Ser Asn Val Thr Gly Thr Leu Thr Ala Thr 
435 - - 440 445 

2 5 Leu Val Met Gly Lys Ser Thr Ala Thr Ala Asn Gly Thr Thr Trp Phe 

450 455 460 



Asn 
465 

(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 9 amino acids 
35 (B) TYPE: amino acid 

( C ) STRAND EDNESS : 
(D) TOPOLOGY: both 



(ii) MOLECULE TYPE: peptide } 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13: 

Ser Val Thr Ala Thr Asn Asp Asn Ser Val Lys lie Val Tyr Arg Thr 
15 10 15 

Thr Asp Gly Thr Gin Val Gly Ser Asn Thr Trp Val Thr Ser Thr Asp 
20 25 30 



Gly Thr Lys Ala Gly Ser Lys Val Ser Asp Lys Ala Ala Asp Gin Thr 
50 35 40 45 

Ala Leu Glu Ala Tyr lie Asn Ala Asn Lys Pro Ser Gly Tyr Thr Val 

50 55 60 

55 Thr Asn Pro Asn Ala Ala Asp Ala Thr Tyr Gly Asn Thr Val Tyr Ala 

65 70 75 80 
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Thr Val Ser Gin Ala Ala Thr Ser Lys Val Ala Leu Lys Val Ser Gly 
85 9 0 95 

Thr Pro Val Thr Thr Ala Leu Thr Thr Ala Asp Ala Asn Asp Lvs Val 
100 105 no 

Ala Ala Asn Asp Thr Thr Ala Asn Gly Ser Ser Val Ala Gly Ser Thr 
115 12 0 12 5 

Val Tyr Ala Ala Gly Thr Lys Leu Ala Gin Leu Thr Thr Asp Leu Thr 
130 135 140 

Gly Glu Lys Gly Gin Val Val Thr Leu Thr Ala lie Asp Thr Asp Leu 
145 150 155 160 

Glu Asp Ala Thr Phe Thr Gly Thr Thr Thr Tyr Tyr Ser- Asp Leu Gly 
165 170 175 



Lys Ala Tyr His Tyr Thr Tyr Thr Tyr Asn Lys Asp Ser Ala Ala Ser 
20 180 i 8 5 190 

Ser Asn Ala Ser Thr Gin Phe Gly Ser Asn Val Thr Gly Thr Leu Thr 
195 200 205 

2 5 Ala 

(2) INFORMATION FOR SEQ ID NO : 14: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 

(D) TOPOLOGY : both 

35 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

4 0 TCAGTTACAG CAACCAACGA TAACAGTGTT AAGATTGTTT ACCGTACGAC TGATGGCACT 6 0 

CAAGTTGGTT CTAACACTTG GGTAACTTCA ACTGATGGTA CAAAGGCAGG TTCTAAGGTA 12 0 

^ AG CGATAAGG CCGCCGATCA AACTGCTCTT GAAGCCTACA TCAATGCTAA CAAGCCTAGC 18 0 

GGTTACACTG TAACTAACCC TAATGCTGCA GATGCTACCT ATGGTAACAC AGTTTACGCT 24 0 

ACTGTTTCCC AAGCAGCTAC . TTCTAAGGTC GCTTTGAAGG TCTCAGGGAC TCCTGTTACT 3 00 

5 0 ACTGCATTGA CTACAGCTGA TGCTAATGAT AAGGTTGCAG CTAACGATAC CACTGCTAAT 3 60 

GGTAGTTCTG TTGCAGGCTC AACAGTCTAT GCTGCTGGTA CTAAGTTGGC TCAATTAACA 42 0 

ACTGACTTGA CTGGTGAAAA GGGTCAAGTT GTCACATTAA CTGCCATCGA TACTGATTTG 4 80 

5 5 

GAAGACGCTA CGTTCACTGG AACTACGACT TACT ATT C AG AT CTTGG TAA AG C ATACCAC 54 0 



SUBS Nil) lb SHEET (RULE 26) 

BNSDOCID: <WO W61631A1.I ^ 



WO 99/51631 



PCT/FI99/00290 



40 

TACACTTACA CTTACAATAA GGACAGTGCT GCTTCTTCAA ATGCAAGTAC CCAATTTGGT 60 0 

TCAAACGTCA CTGGTACTTT AACTGCT 627 
5 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH ': 1680 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



15 



25 



55 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION:189 . . 1583 

2 0 (ix) FEATURE: . 

(A) NAME /KEY : misc_signal 

(B) LOCATION : 18 9 . .278 



(xi) SEQUENCE DESCRIPTION: SEQ -ID NO: 15: 

TCCAACGACA ATCAGAGCGT AATCCTTGTA TCTCCTTAAG GAAATCGCTA TACTTATCTT 6 0 

CGTAGTTAGG GGATAGCTGA TCGGGTCCGC TAATGTTATG AAATAAAATT CTTAACAAAA 12 0 

3 0 GCGCTAACTT CGGTTATACT ATTCTTGCTT GATAAATTAC ATATTTTATG TTTGGAGGAA 180 

GAAAGATT ATG CAA TCA AGT TTA AAG AAA TCT CTT TAC TTG GGC CTT GCC 23 0 

Met- Gin Ser Ser Leu Lys Lys Ser Leu Tyr Leu Gly Leu Ala 
1 5 10 ' 

35 

GCA TTG AGC TTT GCT GGT GTT GCT GCC GTT TCA ACG ACT GCT TCA GCT 2 7 F 

Ala Leu Ser Phe- Ala Gly Val Ala Ala Val Ser Tn.r Thr Ala Ser Ala 
15 20 * 25 30 

4 0 AAG TCA TAC GCT ACT GCA GGT GCC TAT TCA ACG TTA AAG ACG GAC GCT 32 6 

Lys Ser Tyr Ala Thr Ala Gly Ala Tyr Ser Thr Leu Lys Thr Asp Ala 
35 40 1 45 

GCT ACT CGT AAC GTC GAA GCT ACT GGT ACT AAC GCT TTA TAC ACG AAG 3 74 

45 Ala Thr Arg Asn Val . Glu Ala Thr Gly Thr Asn Ala Leu Tyr Thr Lys 

50 55 60 

CCA GGT ACT GTT AAG GGT GCT AAG GTT GTC- GCT TCT AAG GCT ACT ATG 422 
Pro Gly Thr Val Lys Gly Ala Lys- Val Val Ala Ser Lys Ala Thr Met 
50 65 70 75 

GCT AAG TTA GCT TCT TCA AAG AAG TCA GCT GAC TAC TTC CGT GCT TAC 4 70 

Ala Lys Leu Al a . Ser Ser Lys Lys Ser Ala Asp Tyr Phe Arg Ala Tyr 
80 85 90 



GGT GTT AAG ACC ACT AAC CGT GGT TCA GTT TAC TAC CGT GTT GTA ACG 518 
Gly Val Lys Thr Thr Asn Arg Gly Ser Val Tyr Tyr Axg Val Val Thr 
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95 ' 100 105 110 

ATG GAT GGC AAG TAC CGT GGT TAG GTT TAT GGT GGC AAG TC^ GAC ACT 56 6 

Met Asp Gly Lys Tyr Arg Gly Tyr Val Tyr Gly Gly Lys Ser Asp Thr 
115 .12 0 12 5 

GCC TTT GCT GGT GGT ATC AAG TCT GCT GAA ACG ACT ACT AAG GCT GAT 614 
Ala Phe Ala Gly Gly lie Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp 
130 135 140 

ATG CCT GCA CGT ACT ACT GGG TTC TAC TTA ACT GAC ACT TCA AAG AAC 662 
Met Pro Ala Arg Thr Thr Gly Phe Tyr Leu Thr Asp Thr Ser Lys Asn 
145 150 155 



15 ACT CTT TGG ACG GCT CCT AAG TAC ACT. CAA TAC AAG GCA AGT AAA GTT 
Thr Leu Trp Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val 
160 165 170 



20 



GCT ACT AAG ACT CGT GAA GGT TCA. TTA TAC TAT CAC GTA ACT GCT ACT 
Ala Thr Lys Thr Arg Glu Gly Ser Leu Tyr Tyr His Val Thr Ala Thr 
2-5 195 200 205 



30 



35 TCA GTT ACA GCA ACC AAC GAT AAC AGT GTT AAG ATT GTT TAC CGT ACG 
Ser Val Thr Ala Thr Asn Asp Asn Ser Val Lys lie Val Tyr Arg Thr 
240 245 250 



40 



50 



55 ACT GTT TCC CAA GCA GCT ACT TCT AAG GTC GCT TTG AAG GTC TCA GGG 
Thr Val Ser Gin Ala Ala Thr Ser Lys Val Ala Leu Lys Val Ser Gly 
320 325 33 0 
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AGC CTT TAT GGT GTT GCT AAG GAC ACC AAG TTT ACT GTA GAT CAG GCT 75 8 

Ser Leu Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala 
175 I 80 185 iso 



806 



AAC GGT AGT GGT ATT AGT GGT TGG ATT TAC GCT GGT AAG GGC TTC AGT 854 
Asn Gly Ser Gly He Ser Gly Trp He Tyr Ala Gly Lys Gly Phe Ser 
210 215 220 

ACT ACT GCT ACT GGT ACA CAA GTA CTT GGT GGT CTG TCA ACT GAT AAG 902 
.Thr Thr Ala Thr Gly Thr Gin Val Leu Gly Gly Leu Ser Thr Asp Lys 
225 230 235 



950 



ACT GAT GGC ACT CAA GTT GGT TCT AAC ACT TGG GTA ACT TCA ACT GAT ' 998 
Thr Asp Gly Thr Gin Val Gly Ser Asn Thr Trp Val Thr Ser Thr Asp 
255 260 265 27Q 



GGT ACA AAG GCA GGT TCT AAG GTA AGC GAT AAG GCC GCC GAT CAA ACT 104 6 

Gly Thr Lys Ala Gly Ser Lys Val Ser Asp Lys Ala Ala Asp Gin Thr 
45 275 280 285 

GCT CTT GAA GCC TAC ATC AAT GCT AAC AAG CCT AGC GGT TAC ACT GTA 1094 
Ala Leu Glu Ala Tyr He Asn Ala Asn Lys Pro Ser Gly Tyr Thr Val 
290 295 300 



ACT AAC CCT AAT GCT GCA GAT GCT ACC TAT GGT AAC ACA GTT TAC GCT 1142 
Thr Asn Pro Asn Ala Ala Asp Ala Thr Tyr Gly Asn Thr Val Tyr Ala 
305 310 315 



1190 
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ACT CCT GTT ACT ACT GCA TTG ACT ACA GCT GAT GCT AAT GAT AAG GTT 123 8 

Thr Pro Val Thr Thr Ala Leu Thr Thr Ala Asp Ala Asn Asp Lys Val 
335 340 345 350 

5 GCA GCT AAC GAT ACC ACT GCT AAT GGT AGT TCT GTT GCA GGC TCA ACA 12 8 6 

Ala Ala Asn Asp Thr Thr Ala Asn Gly Ser Ser Val Ala Gly Ser Thr 
355 360 365 

GTC TAT GCT GCT GGT ACT AAG TTG GCT CAA TTA ACA ACT GAC TTG ACT 13 34 

10 Val Tyr Ala Ala Gly Thr Lys Leu Ala Gin Leu Thr Thr Asp Leu Thr 
370 375 380 

GGT GAA AAG GGT CAA GTT GTC ACA TTA ACT GCC ATC GAT ACT GAT TTG 13 82 

Gly Glu Lys Gly Gin Val Val Thr Leu Thr Ala lie Asp Thr Asp Leu 
15 - 385 ' 390 -.. " 395 

GAA GAC GCT ACG TTC ACT GGA ACT ACG ACT TAC TAT TCA GAT CTT GGT 14 3 

Glu Asp Ala Thr Phe Thr Gly Thr Thr Thr Tyr Tyr Ser. Asp Leu Gly 

400 405 410 

20 

AAA GCA TAC CAC TAC ACT TAC ACT TAC AAT AAG GAC AGT GCT GCT TCT 1478 

Lys Ala Tyr -His Tyr Thr Tyr Thr Tyr Asn Lys Asp Ser Ala Ala Ser 

415 420 ' - 425 - - 430 

2 5 TCA AAT GCA AGT ACC CAA TTT GGT TCA AAC GTC ACT GGT ACT TTA ACT 15 2 6 

Ser Asn Ala Ser Thr Gin Phe Gly Ser Asn Val Thr Gly Thr Leu Thr 
435 440 445 

GCT ACC CTT GTT ATG GGT AAG TCT ACT GCT ACT GCT AAC GGT ACT ACT 157 4 

3 0 Ala Thr Leu Val Met Gly Lys Ser Thr Ala Thr Ala Asn Gly Thr Thr 

450 455 460 

TGG TTC AAC TAATAATTAT TATTTAGGTG AGCTTTGTTG ATAAAAAGGT 1623 
Trp Phe Asn 

3 5 465 

CTTTTCAACG TTTATGTTGG GGAGACCTTT TTATATTGAA AAAATTAGGC CTTTGTT 16 B. 

40 .(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 65 amino acids 

(B) TYPE: amino acid 

4 5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



50 



Met Gin Ser Ser Leu Lys Lys Ser Leu Tyr Leu Gly Leu Ala Ala Leu 
1 5 10 15 



Ser Phe Ala Gly Val Ala Ala Val Ser Thr Thr Ala Ser Ala Lys Ser 
55 20 25 30 

Tyr Ala Thr Ala Gly Ala Tyr Ser Thr Leu Lys Thr Asp Ala Ala Thr 
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35 40 45 

Arg Asn Val Glu Ala Thr Gly Thr Asn Ala Leu Tyr Thr Lys Pro Gly 
50 55 60 

5 

Thr Val Lys Gly Ala Lys Val Val Ala Ser Lys Ala Thr Met Ala Lys 
65 70 75 80 

Leu Ala Ser Ser Lys Lys Ser Ala Asp Tyr Phe Arg Ala Tyr Gly Val 
10 85 90 95 

Lys Thr Thr Asn Arg Gly Ser Val- Tyr Tyr Arg Val Val Thr Met Asp 
100 105 110 

15 Gly Lys Tyr Arg Gly Tyr Val Tyr Gly "Gly Lys Ser Asp Thr Ala Phe 
115 120 125 

Ala Gly Gly lie Lys Ser Ala Glu Thr Thr Thr Lys Ala Asp Met Pro 
130 135 140 

2 0 

Ala Arg Thr Thr Gly Phe Tyr Leu. Thr Asp Thr Ser Lys Asn Thr Leu 
145 150 155 160 

Tip Thr Ala Pro Lys Tyr Thr Gin Tyr Lys Ala Ser Lys Val Ser Leu 
25 165 170 175 

Tyr Gly Val Ala Lys Asp Thr Lys Phe Thr Val Asp Gin Ala Ala Thr 
180 185 190 

3 0 Lys Thr Arg Glu Gly Ser Leu Tyr Tyr His Val Thr Ala Thr Asn Gly 

195 200. 205 

Ser Gly lie Ser Gly Trp lie Tyr Ala Gly Lys Gly Phe Ser Thr Thr 
210 215 220 

35 

Ala Thr Gly Thr Gin Val Leu Gly Gly Leu Ser Thr Asp Lys Ser Val 
225 230 235 240 

Thr Ala Thr Asn Asp Asn Ser Val Lys lie Val Tyr Arg Thr Thr Asp 
40 245 250 255 

Gly Thr Gin Val Gly Ser Asn Thr Trp Val Thr Ser Thr Asp Gly Thr 
260 265 270 

4 5 Lys Ala Gly Ser Lys Val Ser Asp Lys Ala Ala Asp Gin Thr Ala Leu 

275 280 285 

Glu Ala Tyr lie Asn Ala Asn Lys Pro Ser Gly Tyr Thr Val Thr Asn 
290 295 300 

50 

Pro Asn Ala Ala Asp Ala Thr Tyr Gly Asn Thr Val Tyr Ala Thr Val 
305 310 315 320 

Ser Gin Ala Ala Thr Ser Lys Val Ala Leu Lys Val Ser Gly Thr Pro 
55 325 330 ^ 335 

Val Thr Thr Ala Leu Thr Thr Ala Asp Ala Asn Asp Lys Val Ala Ala 
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340 345 350 

Asn Asp Thr Thr Ala Asn Gly Ser Ser Val Ala Gly Ser Thr Val Tyr 
355 ' 360 365 

5 

Ala Ala Gly Thr Lys Leu Ala Gin Leu Thr Thr Asp Leu Thr Gly Glu 
370 3-75 380 

Lys Gly Gin Val Val Thr Leu Thr Ala lie Asp Thr Asp Leu Glu Asp 
• 10 385 390 395 400 

Ala Thr Phe Thr Gly Thr Thr Thr Tyr Tyr Ser Asp Leu Gly Lys Ala 
405 410 415 

15 Tyr His Tyr Thr Tyr Thr Tyr Asn Lys-Asp Ser Ala Ala Ser Ser Asn 
420 4'25 430 

Ala Ser Thr Gin Phe Gly Ser Asn Val Thr Gly Thr Leu Thr Ala Thr 
435 440 445 

20 

Leu Val Met Gly Lys Ser Thr Ala Thr Ala Asn Gly Thr Thr Trp Phe 
- 450 455 460 

Asn 
25 465 



i 
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What we claim is: 

1. A DNA molecule encoding a polypeptide capable of binding to human and/or animal 
5 epithelial cell types, said DNA molecule being selected from the group consisting of: 

(a) DNA molecules having at least the partial coding sequences of any one of SEQ ID 
NO. 1, SEQ ID NO. 2, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO 5 and/or SEQ ID NO. 
6 excluding the full length SEQ ID NO 6; -•■ 

10 (b) DNA molecules encoding a polypeptide having at least the partial amino acid sequences 
of any one of SEQ ID NO. 7, SEQ ID NO. S, SEQ ED NO.9, SEQ ID NO. 10, SEQ ID 
NO. 1 1 and/or SEQ ID NO 12, excluding the full length SEQ ED NO 12; 
(c) DNA molecules the coding sequences of which differ from the coding sequence of a 
nucleic acid molecule of (a) or (b) due to the degeneracy of the genetic code; 

15 (d) DNA molecules hybridizing under stringent conditions to a molecule of (a), (b) and/or 
(c) excluding the full-length SEQ ED NO 6 ; and 

(e) DNA molecules encoding a polypeptide capable of binding to human and/or animal 
epithelial cell types and having an amino acid sequence which shows at least 40 % identity 
to a sequence contained in (b) excluding the full length SEQ ID NO 12. 

20 

2. The DNA molecule of claim 1 encoding a polypeptide capable of binding to human and/ 
or animal epithelial cell types, wherein said polypeptide is capable of binding to intestinal, 
urogenital and/or endothelial cells. 

2 5 3. The DNA molecule of claim 1 or 2, which originates from the DNA molecule encoding 

Lactobacillus brevis S-layer SlpA protein. 

4. A vector containing a DNA molecule of any one of claim 1, 2 or 3. 

3 0 5.The vector of claim 4, in which the DNA molecule is operably linked to expression and 

optionally to secretion control sequences allowing expression in prokaryotic or eukaryotic 
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host cells. 

6. A host cell transformed with a DNA molecule of any one of claims 1, 2 or 3 or with a 
vector of claim 4 or 5. 

5 

7. A method of constructing a host cell capable of binding to human and/or animal 
epithelial cell types, like intestinal, urogenital and/or endothelial cell types, comprising 
transforming the cell with at least one DNA molecule selected from the group consisting 
of: 

10 (a) DNA molecules having at least the partial coding sequences of any one of SEQ ID Nt 
1, SEQ ID NO. 2, SEQ ID NO. 3, SEQ ID NO. 4, SEQ ID NO 5, and/or SEQ ID NO 6; 

(b) DNA molecules encoding a polypeptide having at least the partial amino acid sequences 
of any one of SEQ ED NO. 7, SEQ ID NO. 8, SEQ ID NO. 9, SEQ ID NO. 10, SEQ ID NO. 
11 and/or SEQ ID NO 12 ; 

15 (c) DNA molecules, the coding sequence of which differ from the coding sequence of a 
DNA molecule of (a) or (b) due to the degeneracy of the genetic code; 

(d) DNA molecules hybridizing under stringent conditions to a molecule of (a), (b) and/or 

(c) ; and 

(e) DNA molecules encoding a polypeptide capable of binding to human and/or animal 

2 0 epithelial cell types, like intestinal, urogenital and/or endothelial cell types and having an 
amino acid sequence which shows at least 40 % identity to a sequence contained in (b). 

8. A host cell constructed by the method of claim 7. 

2 5 9. The host cell of any one of claims 6 or 8, which has probiotic effects. 

10. The host cell of claim 9, wherein the probiotic effects have been enhanced by genetic 
means. 

3 0 11. The host cell of claim 9 or 10, which belongs to lactic acid bacteria or bifidobacteria. 

12. The host cell of any one of claims 6 or 8 tol 1, wherein the host cell is a vaccine carrier. 
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13. The host cell of any one of claims 6 or 8 to 12, wherein the host cell has been 
genetically modified to carry at least one of the factors selected from the group consisting 
of an antigen, an epitope, a single chain antibody, an enzyme, a pharmaceutical protein and 
a toxin. 

5 

14. A method of constructing a polypeptide capable of binding to human and/or animal 
epithelial cell types, like intestinal, urogenital and/or endothelial cell types, comprising 
modifying the gene encoding the polypeptide with a DNA molecule of claim 1 or 7. 

10 15. A gene encoding a preselected protein, wherein the gene encoding the protein is 

genetically modified to bind to human and/or animal epithelial cell types, like intestinal, 
urogenital and/ or endothelial cell types with at least one DNA molecule selected from the 
group consisting of any of the sequences of claim 1 or 7. 

15 16. The gene of claim 15, wherein the preselected protein is selected from the group 

consisting of an antigen, an epitope, a single chain antibody, an enzyme, a pharmaceutical 
protein and a toxin. 

17. A polypeptide encoded by a DNA molecule of claim 1 or 15. 

20 

18. A Lactobacillus brevis strain for carrying preselected factors/ properties to human 
and/or animal epithelial cells or cell surfaces, wherein the strain has the capability of » 
binding to human and/or animal epithelial cell types and wherein the gene encoding the S- 
layer SlpA protein is genetically modified to cany at least one of the factors/properties 

2 5 selected from the group consisting of an antigen, an epitope, a single chain antibody, an 

enzyme, a pharmaceutical protein and a toxin. 

19. A Lactobacillus brevis strain for carrying preselected factors/ properties to human 
and/or animal epithelial cells or cell surfaces, wherein the strain has the capability of 

3 0 binding to human and/or animal epithelial cell types and wherein the strain is genetically 

modified to have improved probiotic effects. 
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20. A Lactobacillus brevis S-layer SlpA protein for carrying preselected factors/ 
properties to human and/or animal epithelial cells or cell surfaces, wherein the protein has 
the capability of binding to human and/or animal epithelial cell types and wherein the gene 
encoding the S-layer SlpA protein is genetically modified to carry at least one of the 

5 factors/properties selected from the group consisting of an antigen, an epitope, a single 
chain antibody, an enzyme, a pharmaceutical protein and a toxin. 

21 . A method for carrying preselected factors/ properties to human and/or animal epithelial 
cells or cell surfaces by using a Lactobacillus brevis strain, wherein the strain has the 

10 capability of binding to human and/or animal epithelial cell types and wherein the gene 
encoding the S-layer SlpA protein is genetically modified to carry at least one of the 
factors/properties selected from the group consisting of an antigen, an epitope, a single 
chain antibody, an enzyme, a pharmaceutical protein and a toxin. 

15 22. A method for carrying preselected factors/ properties to human and/or animal epithelial 
cells or cell surfaces by using a Lactobacillus brevis strain, wherein the strain has the 
capability of binding to human and/or animal epithelial cell types and wherein the strain is 
genetically modified to have improved probiotic effects. 

2 0 23. A method for carrying preselected factors/ properties to human and/or animal epithelial 
cells or cell surfaces by using the Lactobacillus brevis S-layer SlpA protein, wherein the 
protein has the capability of binding to human and/or animal epithelial cell types and 
wherein the gene encoding the S-layer SlpA protein is genetically modified to carry at least 
one of the factors/properties selected from the group consisting of an antigen, an 

2 5 epitope, a single chain antibody, an enzyme, a pharmaceutical protein and a toxin. 

24. The use of the host cells of any one of claims 6 or 8 to 13 for excluding pathogens in 
the cell surfaces of the gastrointestinal or urogenital tract of humans and/ or animals. 
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Expression of sip A fragments as fusions to fliC 
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TCCAACGACAATCAGAGCGTAATCCTTGTATCTCCTTAAGGAAATCGCTATACTTATCTT 



1 1 I I I.I 

CGTAGTTAGGGGATAGCTGATCGGGTCCGCTAATGTTATGAAATAAAATTCTTAACAAAA 



I I II | 

GCGCTAACTTCGGTTATACTATTCTTGCTTGAT.AAATTACATATTTTATGTTTGGAGGAA 



4, 1 1 I I I I 

GAAAGATTATGCAATCAAGTTTAAAGAAATCTCTTTACTTGGGCCTTGCCGCATTGAGCT 

(m}qsslkk-slylg laals 



( slpA coding region 



ttgctggtgttgctgccgtttcaacgactgcttcagctaagtcatacgctactgcaggtg 

© 

31 



fagvaavs t t a s a © s y . a tag 



I I I I I I 

cctattcaacgttaaagacggacgctgctactcgtaacgtcgaagctactggtactaacg 
ay.stlktdaatrnveatgtn 



1 I I I.I I 

ctttatacacgaagccaggtactgttaagggtgctaaggttgtcgcttctaaggctacta 

A L y T K PGTV KG A KV VA S KAT 



I I I I I 2 l 6 , 

TGGCTAAGTTAGCTTCTTCAAAGAAGTCAGCTGACTACTTCCGTGCTTACGGTGTTAAGA 
MAKLASSKKSADY F R A Y G ® K 

96 



Fig. 5A 
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CCACTAACCGTGGTTCAGTTTACTACCGTGTTGTAACGATGGATGGCAAGTACCGTGGTT 
T T .N R G S V ' Y Y R V V T M DG PC Y R G 



I I t I 1 I 
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